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SMAD7 AND USES THEREOF 

Field of the Invention 

This invention relates to nucleic acids and encoded polypeptides which interact with 
5 TGF-p superfamily receptors and which are negative regulators of signaling by those receptors. 
The invention also relates to agents which bind the nucleic acids or polypeptides. The invention 
further relates to methods of using such nucleic acids and polypeptides in the treatment and/or 
diagnosis of disease. 

10 Background of the Invention 

During mammalian embryogenesis and adult tissue homeostasis transforming growth 
factor p (TGF-p) performs pivotal tasks in intercellular communication (Roberts et aL, 1993). 
The cellular effects of this pleiotropic factor are exerted by ligand-induced hetero- 
oligomerization of two distantly related type I and type II serine/threonine kinase receptors, 

15 TpR-I and TpR-II, respectively (Lin and Lodish, 1993; Derynck, 1994; Massague and Weis- 
Garcia, 1996; ten Dijke et aL, 1996). The two receptors, which both are required for signaling, 
act in sequence; TpR-I is a substrate for the constitutively active TpR-II kinase (Wrana et aL, 
1994; Weiser et aL, 1995). TGF-p forms part of a large family of structurally related proteins 
which include activins and bone morphogenetic proteins (BMPs) that signal in a similar fashion, 

20 each employing distinct complexes of type I and type II serine/threonine kinase receptors (Lin 
and Lodish, 1993; Derynck, 1994; Massague and Weis-Garcia, 1996; ten Dijke et aL, 1996). 

Genetic studies of TGF-p-like signalling pathways in Drosophila and Caenorhabditis 
elegans have led to the identification of mothers against dpp (Mad) (Sekelsky et aL, 1995) and 
sma (Savage et aL, 1996) genes, respectively. The products of these related genes perform 

25 essential functions downstream of TGF-P-like ligands acting via serine/threonine kinase 
receptors in these organisms (Wiersdorf et aL, 1996; Newfeld et aL, 1996; Hoodless et aL, 
1996). Vertebrate homologs of Mad and sma have been termed Smads (Derynck et aL, 1996) or 
MADR genes (Wrana and Attisano, 1996). Genetic alterations in Smad2 and Smad4/DPC4 have 
been found in specific tumor subsets, and thus Smads may function as tumor suppressor genes 

30 (Hahn et aL, 1996; Riggins et aL, 1996; Eppert et aL, 1996). Smad proteins share two regions of 
high similarity, termed MH1 and MH2 domains, connected with a variable proline-rich sequence 
(Massague, 1996; Derynck and Zhang, 1996). The C-terminal part of Smad2, when fused to a 
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heterologous DNA-binding domain, was found to have transcriptional activity (Liu et aL, 1996; 
Meersseman et aL, 1997). The intact Smad2 protein when fused to a DNA-binding domain, was 
latent, but transcriptional activity was unmasked after stimulation with ligand (Liu et aL, 1996). 

Different Smads specify different responses using functional assays in Xenopus. Whereas 
5 Smadl induces ventral mesoderm, a BMP-like response, Smad2 induces dorsal mesoderm, an 
activin/TGF-p-like response (Graff et aL, 1996; Baker and Harland, 1996; Thomsen, 1996). 
Upon ligand stimulation Smads become phosphorylated on serine and threonine residues; BMP 
stimulates Smadl phosphorylation, whereas TGF-p induces Smad2 and Smad3 phosphorylation 
(Hoodless et aL, 1996; Liu et aL, 1996; Eppert et aL, 1996; Lechleider et aL, 1996; Yingling et 

10 aL, 1996; Zhang et aL, 1996; Macias-Silva et aL, 1996; Nakao et aL, 1996). Thus certain Smads 
are pathway specific. Pathway specific Smads include Smadl, Smad2, Smad3 and SmadS. 

Smad4 is a common component of TGF-0, activin and BMP signaling (Lagna et aL, 
1996; Zhang et aL, 1997 r ; de Winter et aL, 1997). Smad4 phosphorylation has thus far been 
reported only after activin stimulation of transfected cells (Lagna et aL, 1996). After stimulation 

15 with TGF-p or activin Smad4 interacts with Smad2 or Smad3, and upon BMP challenge a 

heteromeric complex of Smad4 and Smadl has been observed (Lagna et aL, 1996). Upon ligand 
stimulation, Smad complexes translocate from the cytoplasm to the nucleus (Hoodless et aL, 
1996; Liu et aL, 1996; Baker and Harland, 1996; Macias-Silva et aL, 1996), where they, in 
combination with DNA-binding proteins, may regulate gene transcription (Chen et aL, 1996). 

20 

Summary of the Invention 

The invention provides isolated Smad7 nucleic acid molecules, unique fragments of those 
molecules, expression vectors containing the foregoing, and host cells transfected with those 
molecules. The invention also provides isolated polypeptides encoded by the Smad7 nucleic 

25 acids and agents which bind such polypeptides, including antibodies. The foregoing can be used 
in the diagnosis or treatment of conditions characterized by the expression of a Smad7 nucleic 
acid or polypeptide, or lack thereof. The invention also provides methods for identifying 
pharmacological agents useful in the diagnosis or treatment of such conditions. Here, we present 
the identification of Smad7, which opposes pathway specific Smads including Smadl, Smad2 

30 and Smad3 and thus is an inhibitor of the TGF-p superfamily signalling pathways. 

According to one aspect of the invention, an isolated nucleic acid molecule is provided. 
The molecule hybridizes under stringent conditions to a molecule consisting of the nucleic acid 
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sequence of SEQ ID NOs:3 or 5. The isolated nucleic acid molecule codes for a polypeptide 
which inhibits TGF-P superfamily signaling. The invention further embraces nucleic acid 
molecules that differ from the foregoing isolated nucleic acid molecules in codon sequence due 
to the degeneracy of the genetic code. The invention also embraces complements of the 
5 foregoing nucleic acids. 

In certain embodiments, the isolated nucleic acid molecule comprises a molecule 
consisting of the nucleic acid sequence of SEQ ID NO: 7 or 8. Preferably, the isolated nucleic 
acid molecule consists of the nucleic acid sequence of SEQ ID NO:7 or 8, or consists essentially 
of the nucleic acid sequence of SEQ ID NO:3 or 5. 

10 According to another aspect of the invention, an isolated nucleic acid molecule is 

provided. The isolated nucleic acid molecule comprises a molecule consisting of a unique 
fragment of SEQ ID NO: 3 between 12 and 1944 nucleotides in length and complements thereof, 
or a unique fragment of SEQ ID NO:5 between 12 and 1875 nucleotides in length and 
complements thereof, provided that the isolated nucleic acid molecule excludes sequences 

15 consisting only of SEQ ID NO: 1 and SEQ ID NO:2. Preferably the isolated nucleic acid 

molecule excludes molecules consisting solely of nucleotide sequences selected from the group 
consisting of accession numbers AA061644 (SEQ ID NO:l), AA022262 (SEQ ID NO:2), 
AA347307, AA348247, 321995, W78627, W40869, AA033426, AA397050, AA016891, and 
C851 15. In one embodiment, the isolated nucleic acid molecule consists of between 12 and 32 

20 contiguous nucleotides of SEQ ID NO:3, SEQ ID NO:5, or complements of such nucleic acid 
molecules. In preferred embodiments, the unique fragment is at least 14, 15, 16, 17, 18, 20 or 22 
contiguous nucleotides of the nucleic acid sequence of SEQ ID NO:3, SEQ ID NO:5, SEQ ID 
NO:7, SEQ ID NO:8, or complements thereof. 

According to another aspect of the invention, the invention involves expression vectors, 

25 and host cells transformed or transfected with such expression vectors, comprising the nucleic 
acid molecules described above. 

According to still other aspects of the invention, transgenic non-human animals are 
provided. The animals include in certain embodiments the foregoing expression vectors. In 
certain preferred embodiments, the transgenic non-human animal includes a conditional Smad7 

30 expression vector, such as an expression vector that increases expression of Smad7 in a tissue 
specific, development stage specific, or inducible manner. In other embodiments, the transgenic 
non-human animal has reduced expression of Smad7 nucleic acid molecules. In some 
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embodiments, the transgenic non-human animal includes a Smad7 gene disrupted by 
homologous recombination. The disruption can be homozygous or heterozygous. In other 
embodiments, the transgenic non-human animal includes a conditional Smad7 gene disruption, 
such as one mediated by e.g. tissue specific, development stage specific, or inducible, expression 
5 of a recombinase. In yet other embodiments, the transgenic non-human animal includes a trans- 
acting negative regulator of Smad7 expression, such as antisense Smad7 nucleic acid molecules, 
nucleic acid molecules which encode dominant negative Smad7 proteins, Smad7 directed 
ribozymes , etc. 

According to another aspect of the invention, an isolated polypeptide is provided. The 
10 isolated polypeptide is encoded by the isolated nucleic acid molecule of any of claims 1, 2, 3, 4, 
5 or 6, and the polypeptide inhibits TGF-p superfamily signaling activity. 

In other embodiments, the isolated polypeptide consists of a fragment or variant of the 
foregoing which retains the activity of the foregoing. In preferred embodiments, the fragment is 
a C-terminal fragment of Smad7, preferably amino acids 204-426 of SEQ ID NO:4 or SEQ ID 
15 NO:6, or a N-terminal fragment of Smad7, preferably amino acids 2-261 of SEQ ID NO:4 or 
SEQ ID NO:6. 

According to another aspect of the invention, there are provided isolated polypeptides 
which selectively bind a Smad7 protein or fragment thereof, provided that the isolated 
polypeptide is not a TGF-P superfamily type I receptor (e.g., a TGF-p, activin or BMP receptor). 

20 The isolated polypeptide in certain embodiments binds to a polypeptide encoded by the isolated 
nucleic acid molecule of any of claims 1, 2, 3, 4, 5 or 6. In preferred embodiments, isolated 
binding polypeptides include antibodies and fragments of antibodies (e.g., Fab, F(ab) 2 , Fd and 
antibody fragments which include a CDR3 region which binds selectively to an epitope defined 
by the Smad7 polypeptides of the invention, such as SEQ ID NOs:4 or 6). In other preferred 

25 embodiments, the polypeptide is an antibody or fragment thereof which selectively binds an 
epitope defined by a polypeptide selected from the group consisting of SEQ ID NO: 1 1 , SEQ ID 
NO:12, SEQ ID NO:13, and SEQ ID NO:14. In still other preferred embodiments, the isolated 
polypeptide is a monoclonal antibody, a humanized antibody or a chimeric antibody. 

30 The invention provides in another aspect an isolated complex of polypeptides. The 

isolated complex includes a TGF-p superfamily receptor selected from the group consisting of 
activated TGFp superfamily type I receptors and complexes of TGFp superfamily type I 
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receptors and TGFp superfamily type II receptors (e.g. a TGF-p, activin, Vgl or BMP receptors) 
bound to a polypeptide as claimed in claim 1. Preferably the isolated complex includes a 
polypeptide having the amino acid sequence of SEQ ID NO:4 or SEQ ID NO:6. In other 
preferred embodiments, the receptor is selected from the group consisting of TpRI, BMPR-IA, 
5 BMPR-IB, ActR-IA, a complex of TpRI and TpRII, a complex of BMPR-IA and BMPR-II, a 
complex of BMPR-IB and BMPR-II, a complex of ActR-IA and BMPR-II and a complex of 
ActR-IA and ActR-II. 

According to still another aspect of the invention, methods for reducing TGF-P 
superfamily signal transduction in a mammalian cell are provided. The methods involve 

10 contacting a mammalian cell with an amount of an inhibitor of TGF-p superfamily signal 

transduction effective to reduce such signal transduction in the mammalian cell. Preferably the 
TGF-p superfamily signal transduction is mediated by a TGFp superfamily ligand, particularly 
TGF-p 1, activin, Vgl, BMP-4 and/or BMP-7. Other methods are provided for reducing 
phosphorylation of pathway specific Smads (e.g. Smadl, Smad2, Smad3 and/or Smad5) by 

15 contacting a mammalian cell with the inhibitor disclosed above. In certain embodiments of the 
foregoing methods, the inhibitor is an isolated Smad7 polypeptide or a fragment thereof, such as 
a polypeptide encoded by a nucleic acid which hybridizes under stringent conditions to SEQ ID 
NO:3 or 5, nucleic acids which encode the polypeptide of SEQ ID NO:4 or SEQ ID NO:6 or 
degenerates or complements thereof. In some embodiments the nucleic acid encodes amino 

20 acids 204-426 of SEQ ID NO:4 or SEQ ID NO:6 or amino acids 2-261 of SEQ ID NO:4 or SEQ 
ID NO:6. In still other embodiments, the inhibitor is an isolated Smad8 polypeptide or a 
fragment thereof. 

According to still another aspect of the invention, methods for modulating proliferation 
and/or differentiation of a cell are provided. The methods involve contacting a cell with an 
25 amount of an isolated Smad7 polypeptide,or a nucleic acid encoding and expressing such a 

polypeptide, as described above, effective to modulate the proliferation and/or differentiation of 
the cell. 

The invention in a further aspect provides methods for increasing TGF-P superfamily 
signal transduction in a mammalian cell. The mammalian cell is contacted with an agent that 
30 selectively binds to an isolated nucleic acid molecule of the invention or an expression product 
thereof in an amount effective to increase TGF-P superfamily signal transduction. Preferably the 
TGF-P superfamily signal transduction is mediated by a TGFp superfamily ligand selected from 
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the group consisting of TGF-pl, activin, Vgl, BMP-4 and BMP-7. Preferred agents are 
antisense nucleic acids, including modified nucleic acids, and polypeptides including antibodies 
which bind to the polypeptide including the amino acids of SEQ ID NO:4 , the polypeptide 
including the amino acids of SEQ ID NO:6, the polypeptide including the amino acids of SEQ 
5 ID NO: 1 3, the polypeptide including the amino acids of SEQ ID NO: 14, a N-terminal fragment 
of Smad7 or a Oterminal fragment of Smad7, and dominant negative variants of the polypeptide 
of SEQ ID NO:4 or SEQ ID NO:6. 

The invention in still another aspect provides compositions comprising a Smad7 
polypeptide and a pharmaceutically acceptable carrier. 

10 The invention in a further aspect involves methods for decreasing Smad7 TGF-|3 

superfamily inhibitory activity in a subject. An agent that selectively binds to an isolated nucleic 
acid molecule of the invention or an expression product thereof is administered to a subject in 
need of such treatment, in an amount effective to decrease TGFp superfamily signal transduction 
inhibitory activity of Smad7 in the subject. Preferably the TGFp superfamily signal transduction 

15 is mediated by a TGFP superfamily ligand selected from the group consisting of TGF-pi, 
activin, Vgl, BMP-4 and BMP-7. Preferred agents are antisense nucleic acids, including 
modified nucleic acids, and polypeptides including antibodies which bind to the polypeptide 
including the amino acids of SEQ ID NO:4 , the polypeptide including the amino acids of SEQ 
ID NO: 6, the polypeptide including the amino acids of SEQ ID NO: 13, the polypeptide including 

20 the amino acids of SEQ ID NO: 14, a N-terminal fragment of Smad7 or a C-terminal fragment of 
Smad7, and dominant negative variants of the polypeptide of SEQ ID NO:4 or SEQ ID NO:6. 

In another aspect the invention provides methods for diagnosing induction of a TGF-p 
superfamily ligand in a cell. The methods include the steps of (a) measuring the amount of 
Smad7 RNA or polypeptide in the cell and comparing the result of step (a) with a control. 

25 According to still another aspect of the invention, methods are provided for determining 

the presence of a functional TGFp superfamily receptor in a cell. The methods include 
contacting the cell with an amount of TGFp superfamily ligand effective to increase the amount 
of Smad7 in the cell, measuring the amount of Smad7 RNA or polypeptide in the cell, and 
comparing the result of the measurement with a control, wherein an increased amount of Smad7 

30 RNA or polypeptide in the cell indicates the presence of a functional TGFp superfamily receptor. 
Preferably the TGFp superfamily receptor is selected from the group consisting of TGFp 
superfamily type I receptors, TGFp superfamily type II receptors, and complexes of TGFp 
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superfamily type I receptors and TGFp superfamily type II receptors. 

According to another aspect of the invention, methods are provided for identifying lead 
compounds for a pharmacological agent useful in the diagnosis or treatment of disease associated 
with TGFP superfamily signal transduction inhibitory activity of Smad7. One set of methods 
5 involves forming a mixture of a Smad7 polypeptide, a TGF-p superfamily receptor complex or 
an activated TGFp superfamily type I receptor, and a candidate pharmacological agent. The 
mixture is incubated under conditions which, in the absence of the candidate pharmacological 
agent, permit a first amount of specific binding of the TGF-p superfamily receptor complex or 
activated TGFp superfamily type I receptor by the Smad7 polypeptide. A test amount of the 

10 specific binding of the TGF-P superfamily receptor complex or activated type I receptor by the 
Smad7 polypeptide then is detected. Detection of an increase in the foregoing activity in the 
presence of the candidate pharmacological agent indicates that the candidate pharmacological 
agent is a lead compound for a pharmacological agent which increases the TGF-p superfamily 
signal transduction inhibitory activity of Smad7. Detection of a decrease in the foregoing 

15 activities in the presence of the candidate pharmacological agent indicates that the candidate 
pharmacological agent is a lead compound for a pharmacological agent which decreases the 
TGF-p superfamily signal transduction inhibitory activity of Smad7. Another set of methods 
involves forming a mixture as above, adding further a pathway specific Smad polypeptide, and 
detecting first and test amounts of TGF-P superfamily induced phosphorylation of the pathway 

20 specific Smad polypeptide. Detection of an increase in the phosphorylation in the presence of 
the candidate pharmacological agent indicates that the candidate pharmacological agent is a lead 
compound for a pharmacological agent which decreases the TGF-P superfamily signal 
transduction inhibitory activity of Smad7. Detection of a decrease in the foregoing activities in 
the presence of the candidate pharmacological agent indicates that the candidate pharmacological 

25 agent is a lead compound for a pharmacological agent which increases the TGF-P superfamily 
signal transduction inhibitory activity of Smad7. Preferred Smad7 polypeptides include the 
polypeptides of claim 18. 

In the foregoing compositions and methods, preferred members of the TGF-p 
superfamily are TGF-p 1, activin, Vgl, BMP-4 and BMP-7, and the preferred pathway specific 

30 Smad polypeptides are Smadl, Smad2, Smad3 and Smad5. 

According to yet another aspect of the invention, a method for reducing expression of a 
Smad6 or Smad7 nucleic acid or expression product thereof in a cell is provided. The method 
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includes contacting the cell with an amount of an agent which binds selectively to Smad4 
effective to reduce the expression of the Smad6 or Smad7 nucleic acid or expression product 
thereof in the cell. In certain embodiments, the agent is an antisense Smad4 molecule, or an 
antibody that selectively binds to Smad4. 
5 In another aspect, the invention provides a method for increasing Smad6 or Smad7 

expression in a cell. The method includes contacting the cell with an agent selected from the 
group consisting of activin, epidermal growth factor and phorbol esters in an amount effective to 
increase Smad6 or Smad7 expression in the cell. 

According to another aspect of the invention, a method for treating a subject having lung 

10 cancer characterized by elevated expression of a Smad6 gene or a Smad7 gene is provided. The 
method includes administering to the subject an amount of an antisense nucleic acid which binds 
to the expression product of the Smad 6 or Smad7 gene effective to reduce the expression of the 
Smad 6 or Smad7 gene. In other embodiments, the method includes administering a 
polypeptide, such as an antibody or fragment thereof which binds a polypeptide selected from the 

15 group consisting of a polypeptide comprising the amino acid sequence of SEQ ID NO:4, a 
polypeptide comprising the amino acid sequence of SEQ ID NO: 6, aN-terminal fragment of 
Smad7 and a C-terminal fragment of Smad7, a polypeptide comprising the amino acid sequence 
of SEQ ID NO: 10, a polypeptide comprising the amino acid sequence of SEQ ID NO:l 1, a 
polypeptide comprising the amino acid sequence of SEQ ID NO: 12, a polypeptide comprising 

20 the amino acid sequence of SEQ ID NO: 13, and a polypeptide comprising the amino acid 

sequence of SEQ ID NO: 14. In still other embodiments, the agent is a dominant negative variant 
of Smad6 or Smad7. 

According to yet another aspect of the invention, methods for reducing eye defects in a 
developing mammalian embryo are provided. The methods include contacting the cells of the 

25 embryo with an agent which reduces the expression or activity of a Smad7 nucleic acid molecule 
or an expression product thereof. In certain embodiments, the agent selectively binds the Smad7 
nucleic acid molecule or an expression product thereof. In preferred embodiments, the agent is 
an antisense nucleic acid molecule or a polypeptide, and preferably the polypeptide is an 
antibody or fragment thereof which binds a polypeptide selected from the group consisting of a 

30 polypeptide comprising the amino acid sequence of SEQ ID NO: 4, a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 6, a polypeptide comprising the amino acid sequence of 
SEQ ID NO: 13, a polypeptide comprising the amino acid sequence of SEQ ID NO: 14, a N- 
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terminal fragment of Smad7 and a C-terminal fragment of Smad7. The agent also can be a 
dominant negative variant of Smad7. 

According to another aspect of the invention, an isolated polypeptide is provided. The 
polypeptide includes a first polypeptide or fragment thereof linked to a second polypeptide or 
5 fragment thereof, wherein the second polypeptide or fragment thereof comprises a MH2 domain 
of Smad7. The polypeptide is localized in the nucleus of a cell and is exported from the nucleus 
to the cytoplasm of the cell upon TGFP superfamily receptor-mediated signal transduction in 
cells having a TGFp superfamily receptor. 

According to another aspect of the invention, a fusion protein is provided which includes 

10 a Smad7 MH2 domain or a nuclear localization fragment thereof. In other aspects of the 

invention, a fusion protein is provided which includes a Smad7 MH2 domain or a transcriptional 
activation fragment thereof. 

According to still another aspect of the invention, methods for modulating transcription 
of Smad7 -regulated gene transcription are provided. The methods include contacting a 

15 mammalian cell with an agent which modulates TGFp superfamily receptor-mediated signal 
transduction in an amount effective to modulate Smad7 -regulated gene transcription. In some 
embodiments the agent is a TGFp superfamily ligand or an inhibitor of TGFp superfamily 
receptor-mediated signal transduction. 

In another aspect of the invention, an isolated nucleic acid molecule is provided. The 

20 nucleic acid molecule includes the nucleotide sequence of SEQ ID NO: 1 5, an allelic variant 
thereof, or a functional fragment thereof which confers TGFp regulation. Nucleic acid 
molecules which include the nucleotide sequences which hybridize under stringent conditions to 
SEQ ID NO: 15 also are provided. In still other aspects of the invention, isolated nucleic acid 
molecules which include a unique fragment of SEQ ID NO: 15 are provided. Expression vectors 

25 which include the foregoing isolated nucleic acid molecules including or related to SEQ ID 
NO: 15 also are provided 

According to another aspect of the invention, method for regulating transcription of a first 
nucleic acid molecule are provided. The methods include preparing a construct comprising the 
first nucleic acid molecule operably linked to the foregoing isolated nucleic acid molecules 

30 including or related to SEQ ID NO: 15, and introducing the construct into an expression system. 
In some embodiments the expression system is a cell. In preferred embodiments, the cell 
expresses a TGFp superfamily receptor, and the method includes contacting the cell with a TGFp 
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superfamily ligand to increase expression of the first nucleic acid molecule. 

In still other aspects of the invention, methods for identifying modulators of TGF(3- 
regulated transcriptional activity are provided. The methods include providing an expression 
system with a reporter construct including SEQ ID NO: 15 or a TGFp-regulated fragment thereof 
5 operably linked to a nucleic acid encoding a detectable expression product, and contacting the 
expression system with a candidate modulator compound. The expression system is incubated 
under conditions which, in the absence of the candidate modulator, permit a first amount of 
expression of the detectable expression product. A test amount of the expression the detectable 
expression product then is detected. Detection of an increase in the foregoing activity in the 

10 presence of the candidate modulator compound indicates that the candidate modulator compound 
is a compound which increases TGFp-regulated transcriptional activity. Detection of a decrease 
in the foregoing activities in the presence of the candidate modulator compound indicates that the 
candidate modulator compound is a compound which decreases TGFp-regulated transcriptional 
activity. In preferred embodiments the expression system is a cell or an in vitro transcription 

15 system. Other other preferred embodiments, the detectable expression product is a reporter 
protein, such as an enzyme, e.g., luciferase, or a green fluorescent protein. 

The use of the foregoing compositions in the preparation of a medicament is also 
contemplated. 

These and other aspects of the invention will be described in further detail in connection 
20 with the detailed description of the invention. 

Brief Description of the Figures 

Fig. 1 depicts the cloning and tissue distribution of Smad7. 

Fig. 2 shows that Smad7 inhibits TGF-P and activin induced transcriptional responses. 
25 Fig. 3 demonstrates the association of Smad7 with the TGF-p receptor complex. 

Fig. 4 shows that Smad7 is not phosphorylated upon TpR-activation, but Smad7 
overexpression inhibits TGF-P-mediated Smad2 and Smad3 phosphorylation. 
Fig. 5 shows that Smad7 is an early response gene for TGF-P . 

Fig. 6 demonstrates the association of Smad7 with BMP and activin receptor complexes. 
30 Fig. 7 shows that Smad7 inhibits BMP- and activin-receptor mediated phosphorylation of 

Smadl or SmadS. 

Fig. 8 demonstrates that the C-terminal domain of Smad7 associates with the TGF-p 
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receptor complex and that Smad8 and the N-terminal and C-terminal domains of Smad7 inhibit 
TGF-p signalling. 

Fig. 9 shows a comparison of Smad6 and Smad7. A: comparison of the amino acid 
sequences. B: Pairwise alignment relationship between hSmadl through hSmad7 and hSmad9. 
5 C: Expression of Smad6 and Smad7 mRNA in lung carcinoma cell lines. 

Fig. 10 shows that TGF-B superfamily members induce Smad6 and Smad7 mRNA 

levels. 

Fig. 1 1 shows that anti-sense Smad7 mRNA potentiates TGF-p 1 transcriptional response. 
Fig. 12 depicts the effect of EGF and PMA on Smad6 and Smad7 mRNA expression. 
10 Fig. 13 shows that Smad7 inhibits TGF-p-mediated signaling responses more effectively 

than Smad8. 

Fig. 14 shows that overexpression of Smad7 in Xenopus embryos induces formation of a 
partial secondary axis and eye defects. 

Fig. 1 5 shows characterization of expression of Smad7 and Smad6 in stable pMEP-4- 
1 5 driven transfected clones . 

Fig. 16 shows that Smad7 inhibits TGF-p 1 -mediated growth inhibition (A-C) Effective 
of Smad7 on TGF-p 1 -induced growth inhibition in three independent clones (D) Effective of 
Smad6S and (E) Smad6L on TGF-p 1 -induced growth inhibition. 

Fig. 17 shows that Smad7 inhibits TGF-p 1 -mediated induction of early response genes. 
20 Fig. 1 8 shows the subcellular distribution of Smad7 in the absence or presence of TGF- 

pl receptor activation. 

Fig. 19 shows the subcellular distribution of wild-type Smad7 and Smad7 deletion 
mutants in the absence or presence of TGF-p 1 stimulation. 

Fig. 20 depicts the results of a luciferase assays showing TGF-p 1 inducibility of a Smad7 
25 promoter fragment. 

Fig. 21 depicts several restriction enzyme sites and putative binding sites for different 
transcription factors and Smad proteins in the ~725bp Bam Hl-Xho I minimal promoter fragment 
(SEQ ID NO: 15). 



30 



Brief Description of the Sequences 

SEQ ID NO:l is the nucleotide sequence of the mouse EST (accession number 
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AA061644) which has sequence similarity to N-terminal Smad domains. 

SEQ ID NO:2 is the nucleotide sequence of the mouse EST (accession number 
AA022262) which has sequence similarity to C-terminal Smad domains. 

SEQ ID NO: 3 is the nucleotide sequence of the mouse Smad7 cDNA. 
5 SEQ ID NO:4 is the amino acid sequence of the mouse Smad7 protein. 

SEQ ID NO: 5 is the nucleotide sequence of the human Smad7 cDNA. 

SEQ ID NO: 6 is the amino acid sequence of the human Smad7 protein. 

SEQ ID NO: 7 is the nucleotide sequence of the coding region of the mouse Smad7 

cDNA. 

10 SEQ ID NO: 8 is the nucleotide sequence of the coding region of the human Smad7 

cDNA. 

SEQ ID NO:9 is the nucleotide sequence of the human Smad6 cDNA. 
SEQ ID NO: 10 is the amino acid sequence of a Smad7 preferred peptide to which an 
antibody can be raised. 

15 SEQ ID NO: 1 1 is the amino acid sequence of a Smad7 preferred peptide to which an 

antibody can be raised. 

SEQ ID NO: 12 is the amino acid sequence of a Smad7 preferred peptide to which an 
antibody can be raised. 

SEQ ID NO: 13 is the amino acid sequence of a Smad7 preferred peptide to which an 
20 antibody can be raised. 

SEQ ID NO: 14 is the amino acid sequence of a Smad7 preferred peptide to which an 
antibody can be raised. 

SEQ ID NO: 15 is the 725 nucleotide Bam Hl-Xho I Smad7 minimal promoter fragment. 



25 

Detailed Description of the Invention 

The present invention in one aspect involves the cloning of a cDNA encoding a Smad7 
TpR-I receptor-interacting protein. The sequence of the mouse gene is presented as 
SEQ ID NO:3 ? and the predicted amino acid sequence of this gene's protein product is presented 
30 as SEQ ID NO:4. The sequence of the human gene is presented as SEQ ID NO: 5, and the 
predicted amino acid sequence of this gene's protein product is presented as SEQ ID NO:6. 
Analysis of the sequence by comparison to nucleic acid and protein databases determined that 
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Smad7 has a C-terminal domain (the MH2 domain) which is related to other Smad proteins. The 
Smad7 C-terminal domain is most related to Smad6 (48% identity). 

The invention thus involves in one aspect Smad7 polypeptides, genes encoding those 
polypeptides, functional modifications and variants of the foregoing, useful fragments of the 
5 foregoing, as well as therapeutics relating thereto. The expression of these genes affects and is 
affected by TGF-p superfamily expression and signal transduction. The TGF-P superfamily 
members are well known to those of ordinary skill in the art and include TGF-ps, activins, bone 
morphogenetic proteins (BMPs), Vgl, Mullerian inhibitory substance (MIS) and 
growth/differentiation factors (GDFs). 

10 Homologs and alleles of the Smad7 nucleic acids of the invention can be identified by 

conventional techniques. Thus, an aspect of the invention is those nucleic acid sequences which 
code for Smad7 polypeptides and which hybridize to a nucleic acid molecule consisting of the 
coding region of SEQ ID NO: 3 or SEQ ID NO:5, under stringent conditions. The term 
"stringent conditions" as used herein refers to parameters with which the art is familiar. Nucleic 

15 acid hybridization parameters may be found in references which compile such methods, e.g. 
Molecular Cloning: A Laboratory Manual, J. Sambrook, et al., eds., Second Edition, Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989, or Current Protocols in 
Molecular Biology, F.M. Ausubel, et al., eds., John Wiley & Sons, Inc., New York. More 
specifically, stringent conditions, as used herein, refers, for example, to hybridization at 65°C in 

20 hybridization buffer (3.5 x SSC, 0.02% Ficoll, 0.02% polyvinyl pyrrolidone, 0.02% Bovine 
Serum Albumin, 2.5mM NaH 2 P0 4 (pH7), 0.5% SDS, 2mM EDTA). SSC is 0.1 5M sodium 
chloride/0. 15M sodium citrate, pH7; SDS is sodium dodecyl sulphate; and EDTA is 
ethylenediaminetetracetic acid. After hybridization, the membrane upon which the DNA is 
transferred is washed at 2 x SSC at room temperature and then at 0.1 x SSC/0.1 x SDS at 

25 temperatures up to 65°C. 

There are other conditions, reagents, and so forth which can be used, which result in a 
similar degree of stringency. The skilled artisan will be familiar with such conditions, and thus 
they are not given here. It will be understood, however, that the skilled artisan will be able to 
manipulate the conditions in a manner to permit the clear identification of homologs and alleles 

30 of Smad7 nucleic acids of the invention. The skilled artisan also is familiar with the 

methodology for screening cells and libraries for expression of such molecules which then are 
routinely isolated, followed by isolation of the pertinent nucleic acid molecule and sequencing. 
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In general homologs and alleles typically will share at least 40% nucleotide identity 
and/or at least 50% amino acid identity to SEQ ID NOs:3 or 5 and SEQ ID NOs:4 or 6, 
respectively, in some instances will share at least 50% nucleotide identity and/or at least 65% 
amino acid identity and in still other instances will share at least 60% nucleotide identity and/or 
5 at least 75% amino acid identity. Watson-Crick complements of the foregoing nucleic acids also 
are embraced by the invention. 

In screening for Smad7 proteins, a Southern blot may be performed using the foregoing 
conditions, together with a radioactive probe. After washing the membrane to which the DNA is 
finally transferred, the membrane can be placed against X-ray film to detect the radioactive 
10 signal. 

The invention also includes degenerate nucleic acids which include alternative codons to 
those present in the native materials. For example, serine residues are encoded by the codons 
TCA, AGT, TCC, TCG, TCT and AGC. Each of the six codons is equivalent for the purposes of 
encoding a serine residue. Thus, it will be apparent to one of ordinary skill in the art that any of 

15 the serine-encoding nucleotide triplets may be employed to direct the protein synthesis apparatus, 
in vitro or in vivo, to incorporate a serine residue into an elongating Smad7 polypeptide. 
Similarly, nucleotide sequence triplets which encode other amino acid residues include, but are 
not limited to: CCA, CCC, CCG and CCT (proline codons); CGA, CGC, CGG, CGT, AGA and 
AGG (arginine codons); AC A, ACC, ACG and ACT (threonine codons); AAC and AAT 

20 (asparagine codons); and ATA, ATC and ATT (isoleucine codons). Other amino acid residues 
may be encoded similarly by multiple nucleotide sequences. Thus, the invention embraces 
degenerate nucleic acids that differ from the biologically isolated nucleic acids in codon 
sequence due to the degeneracy of the genetic code. 

The invention also provides isolated unique fragments of SEQ ID NOs:3 or 5 or 

25 complements of SEQ ID NOs:3 or 5. A unique fragment is one that is a 'signature' for the larger 
nucleic acid. It, for example, is long enough to assure that its precise sequence is not found in 
molecules outside of the Smad7 nucleic acids defined above. A unique fragment excludes, by 
definition, sequences consisting solely of EST sequences such as those described by SEQ ID 
NOsil and 2. Unique fragments can be used as probes in Southern blot assays to identify such 

30 nucleic acids, or can be used in amplification assays such as those employing PCR. As known to 
those skilled in the art, large probes such as 200 nucleotides or more are preferred for certain 
uses such as Southern blots, while smaller fragments will be preferred for uses such as PCR. 
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Unique fragments also can be used to produce fusion proteins for generating antibodies or 
determining binding of the polypeptide fragments, as demonstrated in the Examples, or for 
generating immunoassay components. Likewise, unique fragments can be employed to produce 
nonfused fragments of the Smad7 polypeptides such as the N-terminal and C-terminal fragments 
5 disclosed herein, useful, for example, in the preparation of antibodies, in immunoassays, and as a 
competitive binding partner of the TGF-p, activin, BMP receptors and/or other polypeptides 
which bind to the Smad7 polypeptides, for example, in therapeutic applications. Unique 
fragments further can be used as antisense molecules to inhibit the expression of Smad7 nucleic 
acids and polypeptides, particularly for therapeutic purposes as described in greater detail below. 

10 As will be recognized by those skilled in the art, the size of the unique fragment will 

depend upon its conservancy in the genetic code. Thus, some regions of SEQ ID NO: 3 and/or 
SEQ ID NO: 5 and its complement will require longer segments to be unique while others will 
require only short segments, typically between 12 and 32 nucleotides (e.g. 12, 13, 14, 15, 16, 17, 
18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31 and 32 bases long). Virtually any segment of 

15 SEQ ID NO:7 or SEQ ID NO: 8, or complements thereof, that is 18 or more nucleotides in length 
will be unique. Those skilled in the art are well versed in methods for selecting such sequences, 
typically on the basis of the ability of the unique fragment to selectively distinguish the sequence 
of interest from non-Smad7 nucleic acids. A comparison of the sequence of the fragment to 
those on known data bases typically is all that is necessary, although in vitro confirmatory 

20 hybridization and sequencing analysis may be performed. 

A unique fragment can be a functional fragment. A functional fragment of a nucleic acid 
molecule of the invention is a fragment which retains some functional property of the larger 
nucleic acid molecule, such as coding for a functional polypeptide, binding to proteins, 
regulating transcription of operably linked nucleic acids, and the like. One of ordinary skill in 

25 the art can readily determine using the assays described herein and those well known in the art to 
determine whether a fragment is a functional fragment of a nucleic acid molecule using no more 
than routine experimentation. 

As mentioned above, the invention embraces antisense oligonucleotides that selectively 
bind to a nucleic acid molecule encoding a Smad7 polypeptide, to increase TGF-p, activin and/or 

30 BMP signaling by reducing the amount of Smad7. This is desirable in virtually any medical 
condition wherein a reduction of Smad7 is desirable, e.g., to increase TGF-P signaling. 
As used herein, the term "antisense oligonucleotide" or "antisense" describes an 
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oligonucleotide that is an oligoribonucleotide, oligodeoxyribonucleotide, modified 
oligoribonucleotide, or modified oligodeoxyribonucleotide which hybridizes under physiological 
conditions to DNA comprising a particular gene or to an mRNA transcript of that gene and, 
thereby, inhibits the transcription of that gene and/or the translation of that mRNA. The 
5 antisense molecules are designed so as to interfere with transcription or translation of a target 
gene upon hybridization with the target gene or transcript. Those skilled in the art will recognize 
that the exact length of the antisense oligonucleotide and its degree of complementarity with its 
target will depend upon the specific target selected, including the sequence of the target and the 
particular bases which comprise that sequence. It is preferred that the antisense oligonucleotide 

10 be constructed and arranged so as to bind selectively with the target under physiological 

conditions, i.e., to hybridize substantially more to the target sequence than to any other sequence 
in the target cell under physiological conditions. Based upon SEQ ID NOs:3, 5 or 9, or upon 
allelic or homologous genomic and/or cDNA sequences, one of skill in the art can easily choose 
and synthesize any of a number of appropriate antisense molecules for use in accordance with the 

15 present invention. In order to be sufficiently selective and potent for inhibition, such antisense 
oligonucleotides should comprise at least 10 and, more preferably, at least 15 consecutive bases 
which are complementary to the target, although in certain cases modified oligonucleotides as 
short as 7 bases in length have been used successfully as antisense oligonucleotides (Wagner et 
al., Nature BiotechnoL 14:840-844, 1996). Most preferably, the antisense oligonucleotides 

20 comprise a complementary sequence of 20-30 bases. Although oligonucleotides may be chosen 
which are antisense to any region of the gene or mRNA transcripts, in preferred embodiments the 
antisense oligonucleotides correspond to N-terminal or 5' upstream sites such as translation 
initiation, transcription initiation or promoter sites. In addition, 3 '-untranslated regions may be 
targeted. Targeting to mRNA splicing sites has also been used in the art but may be less 

25 preferred if alternative mRNA splicing occurs. In addition, the antisense is targeted, preferably, 
to sites in which mRNA secondary structure is not expected (see, e.g., Sainio et al., Cell Mol 
Neurobiol. 14(5):439-457, 1994) and at which proteins are not expected to bind. Finally, 
although SEQ ID Nos:3, 5 and 9 disclose cDNA sequences, one of ordinary skill in the art may 
easily derive the genomic DNA corresponding to the cDNA of SEQ ID Nos:3, 5 or 9. Thus, the 

30 present invention also provides for antisense oligonucleotides which are complementary to the 
genomic DNA corresponding to SEQ ID Nos:3, 5 and 9. Similarly, antisense to allelic or 
homologous cDNAs and genomic DNAs are enabled without undue experimentation. 
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In one set of embodiments, the antisense oligonucleotides of the invention may be 
composed of "natural" deoxyribonucleotides, ribonucleotides, or any combination thereof That 
is, the 5 f end of one native nucleotide and the 3 T end of another native nucleotide may be 
covalently linked, as in natural systems, via a phosphodiester internucleoside linkage. These 
5 oligonucleotides may be prepared by art recognized methods which may be carried out manually 
or by an automated synthesizer. They also may be produced recombinantly by vectors. 

In preferred embodiments, however, the antisense oligonucleotides of the invention also 
may include "modified" oligonucleotides. That is, the oligonucleotides may be modified in a 
number of ways which do not prevent them from hybridizing to their target but which enhance 

10 their stability or targeting or which otherwise enhance their therapeutic effectiveness. 

The term "modified oligonucleotide" as used herein describes an oligonucleotide in 
which (1) at least two of its nucleotides are covalently linked via a synthetic internucleoside 
linkage (i.e., a linkage other than a phosphodiester linkage between the 5' end of one nucleotide 
and the 3' end of another nucleotide) and/or (2) a chemical group not normally associated with 

15 nucleic acids has been covalently attached to the oligonucleotide. Preferred synthetic 
internucleoside linkages are phosphorothioates, alkylphosphonates, phosphorodithioates, 
phosphate esters, alkylphosphonothioates, phosphoramidates, carbamates, carbonates, phosphate 
triesters, acetamidates, carboxymethyl esters and peptides. 

The term "modified oligonucleotide" also encompasses oligonucleotides with a 

20 covalently modified base and/or sugar. For example, modified oligonucleotides include 

oligonucleotides having backbone sugars which are covalently attached to low molecular weight 
organic groups other than a hydroxyl group at the 3' position and other than a phosphate group at 
the 5' position. Thus modified oligonucleotides may include a 2'-0-alkylated ribose group. In 
addition, modified oligonucleotides may include sugars such as arabinose instead of ribose. The 

25 present invention, thus, contemplates pharmaceutical preparations containing modified antisense 
molecules that are complementary to and hybridizable with, under physiological conditions, 
nucleic acids encoding Smad7 polypeptides, together with pharmaceutically acceptable carriers. 

Antisense oligonucleotides may be administered as part of a pharmaceutical composition. 
Such a pharmaceutical composition may include the antisense oligonucleotides in combination 

30 with any standard physiologically and/or pharmaceutically acceptable carriers which are known 
in the art. The compositions should be sterile and contain a therapeutically effective amount of 
the antisense oligonucleotides in a unit of weight or volume suitable for administration to a 
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patient. The term "pharmaceutically acceptable" means a non-toxic material that does not 
interfere with the effectiveness of the biological activity of the active ingredients. The term 
"physiologically acceptable" refers to a non-toxic material that is compatible with a biological 
system such as a cell, cell culture, tissue, or organism. The characteristics of the carrier will 
5 depend on the route of administration. Physiologically and pharmaceutically acceptable carriers 
include diluents, fillers, salts, buffers, stabilizers, solubilizers, and other materials which are well 
known in the art. 

As used herein, a "vector" may be any of a number of nucleic acids into which a desired 
sequence may be inserted by restriction and ligation for transport between different genetic 

10 environments or for expression in a host cell. Vectors are typically composed of DNA although 
RNA vectors are also available. Vectors include, but are not limited to, plasmids, phagemids and 
virus genomes. A cloning vector is one which is able to replicate in a host cell, and which is 
further characterized by one or more endonuclease restriction sites at which the vector may be 
cut in a determinable fashion and into which a desired DNA sequence may be ligated such that 

15 the new recombinant vector retains its ability to replicate in the host cell. In the case of 

plasmids, replication of the desired sequence may occur many times as the plasmid increases in 
copy number within the host bacterium or just a single time per host before the host reproduces 
by mitosis. In the case of phage, replication may occur actively during a lytic phase or passively 
during a lysogenic phase. An expression vector is one into which a desired DNA sequence may 

20 be inserted by restriction and ligation such that it is operably joined to regulatory sequences and 
may be expressed as an RNA transcript. Vectors may further contain one or more marker 
sequences suitable for use in the identification of cells which have or have not been transformed 
or transfected with the vector. Markers include, for example, genes encoding proteins which 
increase or decrease either resistance or sensitivity to antibiotics or other compounds, genes 

25 which encode enzymes whose activities are detectable by standard assays known in the art (e.g., 
fi-galactosidase or alkaline phosphatase), and genes which visibly affect the phenotype of 
transformed or transfected cells, hosts, colonies or plaques (e.g., green fluorescent protein). 
Preferred vectors are those capable of autonomous replication and expression of the structural 
gene products present in the DNA segments to which they are operably joined. 

30 As used herein, a coding sequence and regulatory sequences are said to be "operably" 

joined when they are covalently linked in such a way as to place the expression or transcription 
of the coding sequence under the influence or control of the regulatory sequences. If it is desired 
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that the coding sequences be translated into a functional protein, two DNA sequences are said to 
be operably joined if induction of a promoter in the 5' regulatory sequences results in the 
transcription of the coding sequence and if the nature of the linkage between the two DNA 
sequences does not (1) result in the introduction of a frame-shift mutation, (2) interfere with the 
5 ability of the promoter region to direct the transcription of the coding sequences, or (3) interfere 
with the ability of the corresponding RNA transcript to be translated into a protein. Thus, a 
promoter region would be operably joined to a coding sequence if the promoter region were 
capable of effecting transcription of that DNA sequence such that the resulting transcript might 
be translated into the desired protein or polypeptide. 

10 The precise nature of the regulatory sequences needed for gene expression may vary 

between species or cell types, but shall in general include, as necessary, 5' non-transcribed and 5' 
non-translated sequences involved with the initiation of transcription and translation respectively, 
such as a TATA box, capping sequence, CAAT sequence, and the like. Especially, such 5' non- 
transcribed regulatory sequences will include a promoter region which includes a promoter 

15 sequence for transcriptional control of the operably joined gene. Regulatory sequences may also 
include enhancer sequences or upstream activator sequences as desired. The vectors of the 
invention may optionally include 5' leader or signal sequences. The choice and design of an 
appropriate vector is within the ability and discretion of one of ordinary skill in the art. 

Expression vectors containing all the necessary elements for expression are commercially 

20 available and known to those skilled in the art. See, e.g., Sambrook et aL, Molecular Cloning: A 
Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press, 1989. Cells are 
genetically engineered by the introduction into the cells of heterologous DNA (RNA) encoding 
Smad7 polypeptide or fragment or variant thereof. That heterologous DNA (RNA) is placed 
under operable control of transcriptional elements to permit the expression of the heterologous 

25 DNA in the host cell. 

Preferred systems for mRNA expression in mammalian cells are those such as pRc/CMV 
(available from Invitrogen, Carlsbad, CA) that contain a selectable marker such as a gene that 
confers G418 resistance (which facilitates the selection of stably transfected cell lines) and the 
human cytomegalovirus (CMV) enhancer-promoter sequences. Additionally, suitable for 

30 expression in primate or canine cell lines is the pCEP4 vector (Invitrogen), which contains an 
Epstein Barr virus (EBV) origin of replication, facilitating the maintenance of plasmid as a 
multicopy extrachromosomal element. Another expression vector is the pEF-BOS plasmid 
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containing the promoter of polypeptide Elongation Factor 1 a, which stimulates efficiently 
transcription in vitro. The plasmid is described by Mishizuma and Nagata (Nuc. Acids Res. 
18:5322, 1990), and its use in transfection experiments is disclosed by, for example, Demoulin 
(MoL Cell Biol 16:4710-4716, 1996). Still another preferred expression vector is an 
5 adenovirus, described by Stratford-Perricaudet, which is defective for El and E3 proteins {J. 
Clin. Invest. 90:626-630, 1992). The use of the adenovirus as an Adeno.Pl A recombinant is 
disclosed by Warmer et al., in intradermal injection in mice for immunization against PI A {Int. J. 
Cancer, 67:303-310, 1996). 

The invention also embraces so-called expression kits, which allow the artisan to prepare 

10 a desired expression vector or vectors. Such expression kits include at least separate portions of 
each of the previously discussed coding sequences. Other components may be added, as desired, 
as long as the previously mentioned sequences, which are required, are included. 

The invention also permits the construction of Smad7 gene "knock-outs" in cells and in 
animals, providing materials for studying certain aspects of TGF-P, activin and/or BMP signal 

15 transduction. 

The invention also provides isolated polypeptides, which include the polypeptides of 
SEQ ID NOs:4 and 6 and unique fragments of SEQ ID NOs:4 and 6 including fragments 
comprising amino acids 2-261 of SEQ ID NOs:4 and 6 and amino acids 204-426 of SEQ ID 
NOs:4 and 6. Such polypeptides are useful, for example, alone or as fusion proteins to generate 

20 antibodies, as a components of an immunoassay. 

A unique fragment of an Smad7 polypeptide, in general, has the features and 
characteristics of unique fragments as discussed above in connection with nucleic acids. As will 
be recognized by those skilled in the art, the size of the unique fragment will depend upon factors 
such as whether the fragment constitutes a portion of a conserved protein domain. Thus, some 

25 regions of amino acids 2-261 of SEQ ID NOs:4 and 6 and amino acids 204-426 of SEQ ID 
NOs:4 and 6 will require longer segments to be unique while others will require only short 
segments, typically between 5 and 12 amino acids (e.g. 5, 6, 7, 8, 9, 10, 11 and 12 amino acids 
long). Virtually any segment of amino acids 2-261 of SEQ ID NOs:4 and 6 and amino acids 
204-426 of SEQ ID NOs:4 and 6, that is 10 or more amino acids in length will be unique. 

30 Unique fragments of a polypeptide preferably are those fragments which retain a distinct 

functional capability of the polypeptide. Functional capabilities which can be retained in a 
unique fragment of a polypeptide include interaction with antibodies, interaction with other 
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polypeptides (such as T0R-I) or fragments thereof, selective binding of nucleic acids or proteins, 
and enzymatic activity. For example, as exemplified herein, N-terminal and C-terminal Smad7 
fragments such as those which includes amino acids 2-261 or 204-426 of SEQ ID NO:4 or SEQ 
ID NO: 6 can be used as a functional equivalent of full length Smad7 in the methods of the 
5 invention, including e.g., inhibition of TGF-p signal transduction. Other Smad polypeptide 
fragments, e.g., other N-terminal or C-terminal fragments, can be selected according to their 
functional properties. For example, one of ordinary skill in the art can prepare Smad7 fragments 
recombinantly and test those fragments according to the methods expemplified below, such as 
binding to a TGFp superfamily receptor, or inhibition of pathway specific Smad polypeptide 

10 phosphorylation. Those skilled in the art also are well versed in methods for selecting unique 
amino acid sequences, typically on the basis of the ability of the unique fragment to selectively 
distinguish the sequence of interest from non-family members. A comparison of the sequence of 
the fragment to those on known data bases typically is all that is necessary. 

The invention embraces variants of the Smad7 polypeptides described above. As used 

15 herein, a "variant" of a Smad7 polypeptide is a polypeptide which contains one or more 

modifications to the primary amino acid sequence of a Smad7 polypeptide. Modifications which 
create a Smad7 variant can be made to a Smad7 polypeptide 1) to reduce or eliminate an activity 
of a Smad7 polypeptide, such as binding to TpR-I; 2) to enhance a property of a Smad7 
polypeptide, such as protein stability in an expression system or the stability of protein-protein 

20 binding; or 3) to provide a novel activity or property to a Smad7 polypeptide, such as addition of 
an antigenic epitope or addition of a detectable moiety. Modifications to a Smad7 polypeptide 
are typically made to the nucleic acid which encodes the Smad7 polypeptide, and can include 
deletions, point mutations, truncations, amino acid substitutions and additions of amino acids or 
non-amino acid moieties. Alternatively, modifications can be made directly to the polypeptide, 

25 such as by cleavage, addition of a linker molecule, addition of a detectable moiety, such as 
biotin, addition of a fatty acid, and the like. Modifications also embrace fusion proteins 
comprising all or part of the Smad7 amino acid sequence. 

In general, variants include Smad7 polypeptides which are modified specifically to alter a 
feature of the polypeptide unrelated to its physiological activity. For example, cysteine residues 

30 can be substituted or deleted to prevent unwanted disulfide linkages. Similarly, certain amino 
acids can be changed to enhance expression of a Smad7 polypeptide by eliminating proteolysis 
by proteases in an expression system (e.g., dibasic amino acid residues in yeast expression 
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systems in which KEX2 protease activity is present). 

Mutations of a nucleic acid which encode a Smad7 polypeptide preferably preserve the 
amino acid reading frame of the coding sequence, and preferably do not create regions in the 
nucleic acid which are likely to hybridize to form secondary structures, such a hairpins or loops, 
5 which can be deleterious to expression of the variant polypeptide. 

Mutations can be made by selecting an amino acid substitution, or by random 
mutagenesis of a selected site in a nucleic acid which encodes the polypeptide. Variant 
polypeptides are then expressed and tested for one or more activities to determine which 
mutation provides a variant polypeptide with the desired properties. Further mutations can be 

10 made to variants (or to non- variant Smad7 polypeptides) which are silent as to the amino acid 
sequence of the polypeptide, but which provide preferred codons for translation in a particular 
host. The preferred codons for translation of a nucleic acid in, e.g., E. coli, are well known to 
those of ordinary skill in the art. Still other mutations can be made to the noncoding sequences 
of a Smad7 gene or cDNA clone to enhance expression of the polypeptide. The activity of 

15 variants of Smad7 polypeptides can be tested by cloning the gene encoding the variant Smad7 
polypeptide into a bacterial or mammalian expression vector, introducing the vector into an 
appropriate host cell, expressing the variant Smad7 polypeptide, and testing for a functional 
capability of the Smad7 polypeptides as disclosed herein. For example, the variant Smad7 
polypeptide can be tested for inhibition of TpR-I, activin and/or BMP receptor signaling activity 

20 as disclosed in the Examples, or for inhibition of Smadl, Smad2, Smad3 and/or Smad5 

phosphorylation as is also disclosed herein. Preparation of other variant polypeptides may favor 
testing of other activities, as will be known to one of ordinary skill in the art. 

The skilled artisan will also realize that conservative amino acid substitutions may be 
made in Smad7 polypeptides to provide functionally equivalent variants of the foregoing 

25 polypeptides, i.e, the variants retain the functional capabilities of the Smad7 polypeptides. As 
used herein, a "conservative amino acid substitution" refers to an amino acid substitution which 
does not alter the relative charge or size characteristics of the protein in which the amino acid 
substitution is made. Variants can be prepared according to methods for altering polypeptide 
sequence known to one of ordinary skill in the art such as are found in references which compile 

30 such methods, e.g. Molecular Cloning: A Laboratory Manual, J. Sambrook, et al., eds., Second 
Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989, or Current 
Protocols in Molecular Biology, F.M. Ausubel, et al., eds., John Wiley & Sons, Inc., New York. 



WO 98/53068 PCT7US98/10305 

-23- _ 

Exemplary functionally equivalent variants of the Smad7 polypeptides include conservative 
amino acid substitutions of SEQ ID NOs:4 or 6. Conservative substitutions of amino acids 
include substitutions made amongst amino acids within the following groups: (a) M 5 I, L, V; (b) 
F, Y, W; (c) K, R, H; (d) A 5 G; (e) S, T; (f) Q, N; and (g) E, D. 
5 Conservative amino-acid substitutions in the amino acid sequence of Smad7 polypeptides 

to produce functionally equivalent variants of Smad7 polypeptides typically are made by 
alteration of a nucleic acid encoding a Smad7 polypeptide (SEQ ID NOs:3 and 5). Such 
substitutions can be made by a variety of methods known to one of ordinary skill in the art. For 
example, amino acid substitutions may be made by PCR-directed mutation, site-directed 

10 mutagenesis according to the method of Kunkel (Kunkel, Proc. Nat. Acad. Set U.S.A. 82: 488- 
492, 1985), or by chemical synthesis of a gene encoding a Smad7 polypeptide. Where amino 
acid substitutions are made to a small unique fragment of a Smad7 polypeptide, such as a TpR-I 
binding site peptide, the substitutions can be made by directly synthesizing the peptide. The 
activity of functionally equivalent fragments of Smad7 polypeptides can be tested by cloning the 

15 gene encoding the altered Smad7 polypeptide into a bacterial or mammalian expression vector, 
introducing the vector into an appropriate host cell, expressing the altered Smad7 polypeptide, 
and testing for a functional capability of the Smad7 polypeptides as disclosed herein. Peptides 
which are chemically synthesized can be tested directly for function, e.g., for binding to TpR-I, 
ActR-IB and/or BMPR-IB. 

20 The invention as described herein has a number of uses, some of which are described 

elsewhere herein. First, the invention permits isolation of the Smad7 protein molecules (SEQ ID 
NOs:4 and 6). A variety of methodologies well-known to the skilled practitioner can be utilized 
to obtain isolated Smad7 molecules. The polypeptide may be purified from cells which naturally 
produce the polypeptide by chromatographic means or immunological recognition. 

25 Alternatively, an expression vector may be introduced into cells to cause production of the 
polypeptide. In another method, mRNA transcripts may be microinjected or otherwise 
introduced into cells to cause production of the encoded polypeptide. Translation of mRNA in 
cell-free extracts such as the reticulocyte lysate system also may be used to produce polypeptide. 
Those skilled in the art also can readily follow known methods for isolating Smad7 polypeptides. 

30 These include, but are not limited to, immunochromatography, HPLC, size-exclusion 
chromatography, ion-exchange chromatography and immune-affinity chromatography. 

The isolation of the Smad7 gene also makes it possible for the artisan to diagnose a 
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disorder characterized by expression of Smad7. These methods involve determining expression 
of the Smad7 gene, and/or Smad7 polypeptides derived therefrom. In the former situation, such 
determinations can be carried out via any standard nucleic acid determination assay, including 
the polymerase chain reaction as exemplified in the examples below, or assaying with labeled 
5 hybridization probes. 

The invention also makes it possible isolate proteins such as TpR-I, ActR-IB and 
BMPR-IB by the binding of such proteins to Smad7 as disclosed herein. The identification of 
this binding also permits one of skill in the art to block the binding of Smad7 to other proteins, 
such as TpR-I, as well as blocking the binding of other Smads, such as Smad2 or Smad3 to TpR- 

10 I receptors. Binding of the proteins can be effected by introducing into a biological system in 
which the proteins bind (e.g., a cell) a polypeptide including a Smad7 TpR-I binding site in an 
amount sufficient to block the binding. The identification of a TpR-I domain binding site in 
Smad7 also enables one of skill in the art to prepare modified proteins, using standard 
recombinant DNA techniques, which can bind to proteins such as TpR-I, ActR-IB and 

15 BMPR-IB. For example, when one desires to target a certain protein to a TGF-P receptor 

complex, one can prepare a fusion polypeptide of the protein and the Smad7 TpR-I binding site. 
Additional uses are described further herein. 

The invention further provides methods for reducing or increasing TGF-p family signal 
transduction in a cell. Such methods are useful in vitro for altering the TGF-p signal 

20 transduction, for example, in testing compounds for potential to block aberrant TGF-p signal 
transduction or increase deficient TGF-P signal transduction. In vivo, such methods are useful 
for modulating growth, e.g., to treat cancer and fibrosis. Increasing TGF-P signal transduction in 
a cell by, e.g., introducing a dominant negative Smad7 polypeptide or Smad7 antisense 
oligonucleotides in the cell, can be used to provide a model system for testing the effects of 

25 putative inhibitors of TGF-p signal transduction. Such methods also are useful in the treatment 
of conditions which result from excessive or deficient TGF-p signal transduction. TGF-p signal 
transduction can be measured by a variety of ways known to one of ordinary skill in the art, such 
as the reporter systems described in the Examples. Various modulators of Smad7 activity can be 
screened for effects on TGF-p signal transduction using the methods disclosed herein. The 

30 skilled artisan can first determine the modulation of a Smad7 activity, such as TGF-p signaling 
activity, and then apply such a modulator to a target cell or subject and assess the effect on the 
target cell or subject. For example, in screeing for modulators of Smad7 useful in the treatment 
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of cancer, cells in culture can be contacted with Smad7 modulators and the increase or decrease 
of growth or focus formation of the cells can be determined according to standard procedures. 
Smad7 activity modulators can be assessed for their effects on other TGF-p signal transduction 
downstream effects by similar methods in many cell types. The foregoing also applies to 
5 signalling via activin and BMP complexes. 

The invention also provides, in certain embodiments, "dominant negative" polypeptides 
derived from SEQ ID NOs:4 and/or 6. A dominant negative polypeptide is an inactive variant of 
a protein, which, by interacting with the cellular machinery, displaces an active protein from its 
interaction with the cellular machinery or competes with the active protein, thereby reducing the 

10 effect of the active protein. For example, a dominant negative receptor which binds a ligand but 
does not transmit a signal in response to binding of the ligand can reduce the biological effect of 
expression of the ligand. Likewise, a dominant negative catalytically-inactive kinase which 
interacts normally with target proteins but does not phosphorylate the target proteins can reduce 
phosphorylation of the target proteins in response to a cellular signal. Similarly, a dominant 

15 negative transcription factor which binds to a promoter site in the control region of a gene but 
does not increase gene transcription can reduce the effect of a normal transcription factor by 
occupying promoter binding sites without increasing transcription. 

The end result of the expression of a dominant negative polypeptide in a cell is a 
reduction in function of active proteins. One of ordinary skill in the art can assess the potential 

20 for a dominant negative variant of a protein, and using standard mutagenesis techniques to create 
one or more dominant negative variant polypeptides. For example, given the teachings contained 
herein of a Smad7 polypeptide, one of ordinary skill in the art can modify the sequence of the 
Smad7 polypeptide by site-specific mutagenesis, scanning mutagenesis, partial gene deletion or 
truncation, and the like. See, e.g., U.S. Patent No. 5,580,723 and Sambrook et al., Molecular 

25 Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press, 1989. 
The skilled artisan then can test the population of mutagenized polypeptides for diminution in a 
selected activity (e.g., Smad7 reduction of TGF-p signalling activity) and/or for retention of such 
an activity. Other similar methods for creating and testing dominant negative variants of a 
protein will be apparent to one of ordinary skill in the art. 

30 Dominant negative Smad7 proteins include variants in which a portion of the TpR-I, 

activin receptor or BMP receptor binding site has been mutated or deleted to reduce or eliminate 
Smad7 interaction with the TGF-p, activin, or BMP receptor complex respectively. Other 



WO 98/53068 PCT/US98/10305 

- 26 - _■ 

examples include Smad7 variants in which the ability to inhibit phosphorylation of Smad2 and/or 
Smad3 is reduced. One of ordinary skill in the art can readily prepare and test Smad7 variants 
bearing mutations or deletions in the C-terminal domain (e.g., in the MH2 domain) or in the N- 
terminal domain (e.g., in the glycine/glutamic acid residue rich region). 
5 The invention also involves agents such as polypeptides which bind to Smad7 

polypeptides and to complexes of Smad7 polypeptides and binding partners such as T0R-I, 
ActR-IB and BMPR-IB. Such binding agents can be used, for example, in screening assays to 
detect the presence or absence of Smad7 polypeptides and complexes of Smad7 polypeptides and 
their binding partners and in purification protocols to isolate Smad7 polypeptides and complexes 

10 of Smad7 polypeptides and their binding partners. Such agents also can be used to inhibit the 
native activity of the Smad7 polypeptides or their binding partners, for example, by binding to 
such polypeptides, or their binding partners or both. 

The invention, therefore, embraces peptide binding agents which, for example, can be 
antibodies or fragments of antibodies having the ability to selectively bind to Smad7 

15 polypeptides. Antibodies include polyclonal and monoclonal antibodies, prepared according to 
conventional methodology. 

Significantly, as is well-known in the art, only a small portion of an antibody molecule, 
the paratope, is involved in the binding of the antibody to its epitope (see, in general, Clark, 
W.R. (1986) The Experimental Foundations of Modern Immunology Wiley & Sons, Inc., New 

20 York; Roitt, I. (1991) Essential Immunology . 7th Ed., Blackwell Scientific Publications, 

Oxford). The pFc' and Fc regions, for example, are effectors of the complement cascade but are 
not involved in antigen binding. An antibody from which the pFc' region has been enzymatically 
cleaved, or which has been produced without the pFc 1 region, designated an F(ab') 2 fragment, 
retains both of the antigen binding sites of an intact antibody. Similarly, an antibody from which 

25 the Fc region has been enzymatically cleaved, or which has been produced without the Fc region, 
designated an Fab fragment, retains one of the antigen binding sites of an intact antibody 
molecule. Proceeding further, Fab fragments consist of a covalently bound antibody light chain 
and a portion of the antibody heavy chain denoted Fd. The Fd fragments are the major 
determinant of antibody specificity (a single Fd fragment may be associated with up to ten 

30 different light chains without altering antibody specificity) and Fd fragments retain epitope- 
binding ability in isolation. 

Within the antigen-binding portion of an antibody, as is well-known in the art, there are 
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complementarity determining regions (CDRs), which directly interact with the epitope of the 
antigen, and framework regions (FRs), which maintain the tertiary structure of the paratope (see, 
in general, Clark, 1986; Roitt, 1991). In both the heavy chain Fd fragment and the light chain of 
IgG immunoglobulins, there are four framework regions (FR1 through FR4) separated 
5 respectively by three complementarity determining regions (CDR1 through CDR3). The CDRs, 
and in particular the CDR3 regions, and more particularly the heavy chain CDR3, are largely 
responsible for antibody specificity. 

It is now well-established in the art that the non-CDR regions of a mammalian antibody 
may be replaced with similar regions of conspecific or heterospecific antibodies while retaining 

10 the epitopic specificity of the original antibody. This is most clearly manifested in the 

development and use of "humanized" antibodies in which non-human CDRs are covalently 
joined to human FR and/or Fc/pFc' regions to produce a functional antibody. Thus, for example, 
PCT International Publication Number WO 92/04381 teaches the production and use of 
humanized murine RS V antibodies in which at least a portion of the murine FR regions have 

15 been replaced by FR regions of human origin. Such antibodies, including fragments of intact 
antibodies with antigen-binding ability, are often referred to as "chimeric" antibodies. 

Thus, as will be apparent to one of ordinary skill in the art, the present invention also 
provides for F(ab') 2 , Fab, Fv and Fd fragments; chimeric antibodies in which the Fc and/or FR 
and/or CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by homologous 

20 human or non-human sequences; chimeric F(ab') 2 fragment antibodies in which the FR and/or 
CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by homologous human 
or non-human sequences; chimeric Fab fragment antibodies in which the FR and/or CDR1 and/or 
CDR2 and/or light chain CDR3 regions have been replaced by homologous human or non- 
human sequences; and chimeric Fd fragment antibodies in which the FR and/or CDR1 and/or 

25 CDR2 regions have been replaced by homologous human or non-human sequences. The present 
invention also includes so-called single chain antibodies. 

Thus, the invention involves polypeptides of numerous size and type that bind 
specifically to Smad7 polypeptides, and complexes of both Smad7 polypeptides and their 
binding partners. These polypeptides may be derived also from sources other than antibody 

30 technology. For example, such polypeptide binding agents can be provided by degenerate 
peptide libraries which can be readily prepared in solution, in immobilized form or as phage 
display libraries. Combinatorial libraries also can be synthesized of peptides containing one or 
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more amino acids. Libraries further can be synthesized of peptoids and non-peptide synthetic 
moieties. 

Phage display can be particularly effective in identifying binding peptides useful 
according to the invention. Briefly, one prepares a phage library (using e.g. ml 3, fd, or lambda 
5 phage), displaying inserts from 4 to about 80 amino acid residues using conventional procedures. 
The inserts may represent, for example, a completely degenerate or biased array. One then can 
select phage-bearing inserts which bind to the Smad7 polypeptide. This process can be repeated 
through several cycles of reselection of phage that bind to the Smad7 polypeptide. Repeated 
rounds lead to enrichment of phage bearing particular sequences. DNA sequence analysis can be 

10 conducted to identify the sequences of the expressed polypeptides. The minimal linear portion of 
the sequence that binds to the Smad7 polypeptide can be determined. One can repeat the 
procedure using a biased library containing inserts containing part or all of the minimal linear 
portion plus one or more additional degenerate residues upstream or downstream thereof. Yeast 
two-hybrid screening methods also may be used to identify polypeptides that bind to the Smad7 

15 polypeptides. Thus, the Smad7 polypeptides of the invention, or a fragment thereof, can be used 
to screen peptide libraries, including phage display libraries, to identify and select peptide 
binding partners of the Smad7 polypeptides of the invention. Such molecules can be used, as 
described, for screening assays, for purification protocols, for interfering directly with the 
functioning of Smad7 and for other purposes that will be apparent to those of ordinary skill in the 

20 art. 

A Smad7 polypeptide, or a fragment thereof, also can be used to isolate their native 
binding partners, including, e.g., the TGF-p, activin, or BMP receptor complexes. Isolation of 
such binding partners may be performed according to well-known methods. For example, 
isolated Smad7 polypeptides can be attached to a substrate (e.g., chromatographic media, such as 

25 polystyrene beads, or a filter), and then a solution suspected of containing the TGF-p receptor 
complex may be applied to the substrate. If a TGF-P receptor complex which can interact with 
Smad7 polypeptides is present in the solution, then it will bind to the substrate-bound Smad7 
polypeptide. The TGF-p receptor complex then may be isolated. Other proteins which are 
binding partners for Smad7, such as other Smads, activin receptor complexes, and BMP receptor 

30 complexes may be isolated by similar methods without undue experimentation. 

It will also be recognized that the invention embraces the use of the Smad7 cDNA 
sequences in expression vectors, as well as to transfect host cells and cell lines, be these 
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prokaryotic (e.g., E. coli), or eukaryotic (e.g., CHO cells, COS cells, yeast expression systems 
and recombinant baculovirus expression in insect cells). Especially useful are mammalian cells 
such as human, mouse, hamster, pig, goat, primate, etc. They may be of a wide variety of tissue 
types, and include primary cells and cell lines. Specific examples include keratinocytes, 
5 peripheral blood leukocytes, bone marrow stem cells and embryonic stem cells. The expression 
vectors require that the pertinent sequence, i.e., those nucleic acids described supra, be operably 
linked to a promoter. 

The invention also includes transgenic non-human animals. As used herein, "transgenic 
non-human animals" includes non-human animals having one or more exogenous nucleic acid 

10 molecules incorporated in germ line cells and/or somatic cells. Thus the transgenic animal 
include "knockout" animals having a homozygous or heterozygous gene disruption by 
homologous recombination, animals having episomal or chromosomally incorporated expression 
vectors, etc. Knockout animals can be prepared by homologous recombination using embryonic 
stem cells as is well known in the art. The recombination can be facilitated by the cre/lox system 

15 or other recombinase systems known to one of ordinary skill in the art. In certain embodiments, 
the recombinase system itself is expressed conditionally, for example, in certain tissues or cell 
types, at certain embryonic or post-embryonic developmental stages, inducibly by the addition of 
a compound which increases or decreases expression, and the like. In general, the conditional 
expression vectors used in such systems use a variety of promoters which confer the desired gene 

20 expression pattern (e.g., temporal or spatial). Conditional promoters also can be operably linked 
to Smad7 nucleic acid molecules to increase expression of Smad7 in a regulated or conditional 
manner. Trans-acting negative regulators of Smad7 activity or expression also can be operably 
linked to a conditional promoter as described above. Such trans-acting regulators include 
antisense Smad7 nucleic acids molecules, nucleic acid molecules which encode dominant 

25 negative Smad7 molecules, ribozyme molecules specific for Smad7 nucleic acids, and the like. 
The transgenic non-human animals are useful in experiments directed toward testing biochemical 
or physiological effects of diagnostics or therapeutics for conditions characterized by increased 
or decreased Smad7 expression. Other uses will be apparent to one of ordinary skill in the art. 
The invention also contemplates gene therapy. The procedure for performing ex vivo 

30 gene therapy is outlined in U.S. Patent 5,399,346 and in exhibits submitted in the file history of 
that patent, all of which are publicly available documents. In general, it involves introduction in 
vitro of a functional copy of a gene into a cell(s) of a subject which contains a defective copy of 
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the gene, and returning the genetically engineered cell(s) to the subject. The functional copy of 
the gene is under operable control of regulatory elements which permit expression of the gene in 
the genetically engineered cell(s). Numerous transfection and transduction techniques as well as 
appropriate expression vectors are well known to those of ordinary skill in the art, some of which 
5 are described in PCT application WO95/00654. In vivo gene therapy using vectors such as 
adenovirus, retroviruses, herpes virus, and targeted liposomes also is contemplated according to 
the invention. 

The invention further provides efficient methods of identifying pharmacological agents or 
lead compounds for agents active at the level of a Smad7 or Smad7 fragment modulatable 

10 cellular function. In particular, such functions include TGF-P, activin and/or BMP signal 

transduction and formation of a TGF-p, activin and/or BMP receptor-Smad7 protein complex. 
Generally, the screening methods involve assaying for compounds which interfere with a Smad7 
activity such as TGF-p receptor- Smad7 binding, etc, although compounds which enhance Smad7 
activity also can be assayed using the screening methods. Such methods are adaptable to 

15 automated, high throughput screening of compounds. The target therapeutic indications for 
pharmacological agents detected by the screening methods are limited only in that the target 
cellular function be subject to modulation by alteration of the formation of a complex comprising 
a Smad7 polypeptide or fragment thereof and one or more natural Smad7 intracellular binding 
targets, such as TGF-P receptor. Target indications include cellular processes modulated by 

20 TGF-p, activin and/or BMP signal transduction following receptor-ligand binding. 

A wide variety of assays for pharmacological agents are provided, including, labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays, cell- 
based assays such as two- or three-hybrid screens, expression assays, etc. For example, three- 
hybrid screens are used to rapidly examine the effect of transfected nucleic acids on the 

25 intracellular binding of Smad7 or Smad7 fragments to specific intracellular targets. The 

transfected nucleic acids can encode, for example, combinatorial peptide libraries or antisense 
molecules. Convenient reagents for such assays, e.g., GAL4 fusion proteins, are known in the 
art. An exemplary cell-based assay involves transfecting a cell with a nucleic acid encoding a 
Smad7 polypeptide fused to a GAL4 DNA binding domain and a nucleic acid encoding a TGF-p 

30 receptor domain which interacts with Smad7 fused to a transcription activation domain such as 
VP 16. The cell also contains a reporter gene operably linked to a gene expression regulatory 
region, such as one or more GAL4 binding sites. Activation of reporter gene transcription occurs 
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when the Smad7 and TGF-p receptor fusion polypeptides bind such that the GAL4 DNA binding 
domain and the VP 16 transcriptional activation domain are brought into proximity to enable 
transcription of the reporter gene. Agents which modulate a Smad7 polypeptide mediated cell 
function are then detected through a change in the expression of reporter gene. Methods for 
5 determining changes in the expression of a reporter gene are known in the art. 

Smad7 fragments used in the methods, when not produced by a transfected nucleic acid 
are added to an assay mixture as an isolated polypeptide. Smad7 polypeptides preferably are 
produced recombinantly, although such polypeptides may be isolated from biological extracts. 
Recombinantly produced Smad7 polypeptides include chimeric proteins comprising a fusion of a 

10 Smad7 protein with another polypeptide, e.g., a polypeptide capable of providing or enhancing 
protein-protein binding, sequence specific nucleic acid binding (such as GAL4), enhancing 
stability of the Smad7 polypeptide under assay conditions, or providing a detectable moiety, such 
as green fluorescent protein or Flag epitope as provided in the examples below. 

The assay mixture is comprised of a natural intracellular Smad7 binding target such as a 

15 TGF-p receptor or fragment thereof capable of interacting with Smad7. While natural Smad7 
binding targets may be used, it is frequently preferred to use portions (e.g., peptides or nucleic 
acid fragments) or analogs (i.e., agents which mimic the Smad7 binding properties of the natural 
binding target for purposes of the assay) of the Smad7 binding target so long as the portion or 
analog provides binding affinity and avidity to the Smad7 fragment measurable in the assay. 

20 The assay mixture also comprises a candidate pharmacological agent. Typically, a 

plurality of assay mixtures are run in parallel with different agent concentrations to obtain a 
different response to the various concentrations. Typically, one of these concentrations serves as 
a negative control, i.e., at zero concentration of agent or at a concentration of agent below the 
limits of assay detection. Candidate agents encompass numerous chemical classes, although 

25 typically they are organic compounds. Preferably, the candidate pharmacological agents are 
small organic compounds, i.e., those having a molecular weight of more than 50 yet less than 
about 2500, preferably less than about 1000 and, more preferably, less than about 500. 
Candidate agents comprise functional chemical groups necessary for structural interactions with 
polypeptides and/or nucleic acids, and typically include at least an amine, carbonyl, hydroxyl or 

30 carboxyl group, preferably at least two of the functional chemical groups and more preferably at 
least three of the functional chemical groups. The candidate agents can comprise cyclic carbon 
or heterocyclic structure and/or aromatic or polyaromatic structures substituted with one or more 
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of the above-identified functional groups. Candidate agents also can be biomolecules such as 
peptides, saccharides, fatty acids, sterols, isoprenoids, purines, pyrimidines, derivatives or 
structural analogs of the above, or combinations thereof and the like. Where the agent is a 
nucleic acid, the agent typically is a DNA or RNA molecule, although modified nucleic acids as 
5 defined herein are also contemplated. 

Candidate agents are obtained from a wide variety of sources including libraries of 
synthetic or natural compounds. For example, numerous means are available for random and 
directed synthesis of a wide variety of organic compounds and biomolecules, including 
expression of randomized oligonucleotides, synthetic organic combinatorial libraries, phage 

10 display libraries of random peptides, and the like. Alternatively, libraries of natural compounds 
in the form of bacterial, fungal, plant and animal extracts are available or readily produced. 
Additionally, natural and synthetically produced libraries and compounds can be readily be 
modified through conventional chemical, physical, and biochemical means. Further, known 
pharmacological agents may be subjected to directed or random chemical modifications such as 

15 acylation, alkylation, esterification, amidification, etc. to produce structural analogs of the 
agents. 

A variety of other reagents also can be included in the mixture. These include reagents 
such as salts, buffers, neutral proteins (e.g., albumin), detergents, etc. which may be used to 
facilitate optimal protein-protein and/or protein-nucleic acid binding. Such a reagent may also 

20 reduce non-specific or background interactions of the reaction components. Other reagents that 
improve the efficiency of the assay such as protease, inhibitors, nuclease inhibitors, antimicrobial 
agents, and the like may also be used. 

The mixture of the foregoing assay materials is incubated under conditions whereby, but 
for the presence of the candidate pharmacological agent, the Smad7 polypeptide specifically 

25 binds the cellular binding target, a portion thereof or analog thereof. The order of addition of 
components, incubation temperature, time of incubation, and other perimeters of the assay may 
be readily determined. Such experimentation merely involves optimization of the assay 
parameters, not the fundamental composition of the assay. Incubation temperatures typically are 
between 4°C and 40°C. Incubation times preferably are minimized to facilitate rapid, high 

30 throughput screening, and typically are between 0.1 and 10 hours. 

After incubation, the presence or absence of specific binding between the Smad7 
polypeptide and one or more binding targets is detected by any convenient method available to 
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the user. For cell free binding type assays, a separation step is often used to separate bound from 
unbound components. The separation step may be accomplished in a variety of ways. 
Conveniently, at least one of the components is immobilized on a solid substrate, from which the 
unbound components may be easily separated. The solid substrate can be made of a wide variety 
5 of materials and in a wide variety of shapes, e.g., microtiter plate, microbead, dipstick, resin 
particle, etc. The substrate preferably is chosen to maximum signal to noise ratios, primarily to 
minimize background binding, as well as for ease of separation and cost. 

Separation may be effected for example, by removing a bead or dipstick from a reservoir, 
emptying or diluting a reservoir such as a microtiter plate well, rinsing a bead, particle, 

10 chromotograpic column or filter with a wash solution or solvent. The separation step preferably 
includes multiple rinses or washes. For example, when the solid substrate is a microtiter plate, 
the wells may be washed several times with a washing solution, which typically includes those 
components of the incubation mixture that do not participate in specific bindings such as salts, 
buffer, detergent, non-specific protein, etc. Where the solid substrate is a magnetic bead, the 

15 beads may be washed one or more times with a washing solution and isolated using a magnet. 

Detection may be effected in any convenient way for cell-based assays such as two- or 
three-hybrid screens. The transcript resulting from a reporter gene transcription assay of Smad7 
polypeptide interacting with a target molecule typically encodes a directly or indirectly 
detectable product, e.g., P-galactosidase activity, luciferase activity, and the like. For cell free 

20 binding assays, one of the components usually comprises, or is coupled to, a detectable label. A 
wide variety of labels can be used, such as those that provide direct detection (e.g., radioactivity, 
luminescence, optical or electron density, etc), or indirect detection (e.g., epitope tag such as the 
FLAG epitope, enzyme tag such as horseseradish peroxidase, etc.). The label may be bound to a 
Smad7 binding partner, or incorporated into the structure of the binding partner. 

25 A variety of methods may be used to detect the label, depending on the nature of the label 

and other assay components. For example, the label may be detected while bound to the solid 
substrate or subsequent to separation from the solid substrate. Labels may be directly detected 
through optical or electron density, radioactive emissions, nonradiative energy transfers, etc. or 
indirectly detected with antibody conjugates, strepavidin-biotin conjugates, etc. Methods for 

30 detecting the labels are well known in the art. 

The invention provides Smad7-specific binding agents, methods of identifying and 
making such agents, and their use in diagnosis, therapy and pharmaceutical development. For 



WO 98/53068 PCTYUS98/10305 

-34- _ 

example, Smad7-specific pharmacological agents are useful in a variety of diagnostic and 
therapeutic applications, especially where disease or disease prognosis is associated with 
improper utilization of a pathway involving Smad7, e.g., TGF-p induced phosphorylation of 
Smad2 or Smad3, TGF-p receptor-Smad7 complex formation, activin or BMP signalling, etc. 
5 Novel Smad7-specific binding agents include Smad7-specific antibodies and other natural 
intracellular binding agents identified with assays such as two hybrid screens, and non-natural 
intracellular binding agents identified in screens of chemical libraries and the like. 

In general, the specificity of Smad7 binding to a binding agent is shown by binding 
equilibrium constants. Targets which are capable of selectively binding a Smad7 polypeptide 

10 preferably have binding equilibrium constants of at least about 10 7 M" 1 , more preferably at least 
about 10 8 M' 1 , and most preferably at least about 10 9 M" 1 . The wide variety of cell based and cell 
free assays may be used to demonstrate Smad7-specific binding. Cell based assays include one, 
two and three hybrid screens, assays in which Smad7 -mediated transcription is inhibited or 
increased, etc. Cell free assays include Smad7-protein binding assays, immunoassays, etc. 

15 Other assays useful for screening agents which bind Smad7 polypeptides include fluorescence 
resonance energy transfer (FRET), and electrophoretic mobility shift analysis (EMS A). 

Various techniques may be employed for introducing nucleic acids of the invention into 
cells, depending on whether the nucleic acids are introduced in vitro or in vivo in a host. Such 
techniques include transfection of nucleic acid-CaP0 4 precipitates, transfection of nucleic acids 

20 associated with DEAE, transfection with a retrovirus including the nucleic acid of interest, 

liposome mediated transfection, and the like. For certain uses, it is preferred to target the nucleic 
acid to particular cells. In such instances, a vehicle used for delivering a nucleic acid of the 
invention into a cell (e.g., a retrovirus, or other virus; a liposome) can have a targeting molecule 
attached thereto. For example, a molecule such as an antibody specific for a surface membrane 

25 protein on the target cell or a ligand for a receptor on the target cell can be bound to or 
incorporated within the nucleic acid delivery vehicle. For example, where liposomes are 
employed to deliver the nucleic acids of the invention, proteins which bind to a surface 
membrane protein associated with endocytosis may be incorporated into the liposome 
formulation for targeting and/or to facilitate uptake. Such proteins include capsid proteins or 

30 fragments thereof tropic for a particular cell type, antibodies for proteins which undergo 

internalization in cycling, proteins that target intracellular localization and enhance intracellular 
half life, and the like. Polymeric delivery systems also have been used successfully to deliver 



WO 98/53068 PCT/US98/10305 

-35- _ 

nucleic acids into cells, as is known by those skilled in the art. Such systems even permit oral 
delivery of nucleic acids. 

When administered, the therapeutic compositions of the present invention are 
administered in pharmaceutically acceptable preparations. Such preparations may routinely 
5 contain pharmaceutically acceptable concentrations of salt, buffering agents, preservatives, 

compatible carriers, supplementary immune potentiating agents such as adjuvants and cytokines 
and optionally other therapeutic agents. 

The therapeutics of the invention can be administered by any conventional route, 
including injection or by gradual infusion over time. The administration may, for example, be 

10 oral, intravenous, intraperitoneal, intramuscular, intracavity, subcutaneous, or transdermal. 
When antibodies are used therapeutically, a preferred route of administration is by pulmonary 
aerosol. Techniques for preparing aerosol delivery systems containing antibodies are well 
known to those of skill in the art. Generally, such systems should utilize components which will 
not significantly impair the biological properties of the antibodies, such as the paratope binding 

15 capacity (see, for example, Sciarra and Cutie, "Aerosols," in Remington's Pharmaceutical 

Sciences . 18th edition, 1990, pp 1694-1712; incorporated by reference). Those of skill in the art 
can readily determine the various parameters and conditions for producing antibody aerosols 
without resort to undue experimentation. When using antisense preparations of the invention, 
slow intravenous administration is preferred. 

20 The compositions of the invention are administered in effective amounts. An "effective 

amount" is that amount of a composition that alone, or together with further doses, produces the 
desired response, e.g. alters favorably the signal transduction resulting from binding of TGF-p, 
activin, BMP and/or Vgl (Vgr-1) to specific receptors. In the case of treating a particular 
disease, such as cancer, the desired response is inhibiting the progression of the disease. This 

25 may involve only slowing the progression of the disease temporarily, although more preferably, 
it involves halting the progression of the disease permanently. This can be monitored by routine 
methods or can be monitored according to diagnostic methods of the invention discussed herein. 

Such amounts will depend, of course, on the particular condition being treated, the 
severity of the condition, the individual patient parameters including age, physical condition, size 

30 and weight, the duration of the treatment, the nature of concurrent therapy (if any), the specific 
route of administration and like factors within the knowledge and expertise of the health 
practioner. These factors are well known to those of ordinary skill in the art and can be 
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addressed with no more than routine experimentation. It is generally preferred that a maximum 
dose of the individual components or combinations thereof be used, that is, the highest safe dose 
according to sound medical judgment. It will be understood by those of ordinary skill in the art, 
however, that a patient may insist upon a lower dose or tolerable dose for medical reasons, 
5 psychological reasons or for virtually any other reasons. 

The pharmaceutical compositions used in the foregoing methods preferably are sterile 
and contain an effective amount of Smad7 or nucleic acid encoding Smad7 for producing the 
desired response in a unit of weight or volume suitable for administration to a patient. The 
response can, for example, be measured by determining the signal transduction enhanced or 

10 inhibited by the Smad7 composition via a reporter system as described herein, by measuring 
downstream effects such as gene expression, or by measuring the physiological effects of the 
Smad7 composition, such as regression of a tumor or decrease of disease symptoms. Likewise, 
the effects of antisense Smad6 and Smad7 can be readily determined by measuring expression of 
the individual genes in cells to which an antisense composition is added. Other assays will be 

15 known to one of ordinary skill in the art and can be employed for measuring the level of the 
response. 

The doses of Smad7 polypeptide or nucleic acid administered to a subject can be chosen 
in accordance with different parameters, in particular in accordance with the mode of 
administration used and the state of the subject. Other factors include the desired period of 

20 treatment. In the event that a response in a subject is insufficient at the initial doses applied, 
higher doses (or effectively higher doses by a different, more localized delivery route) may be 
employed to the extent that patient tolerance permits. 

In general, doses of Smad7 are formulated and administered in doses between 1 ng and 
1 mg, and preferably between 10 ng and 100 /*g, according to any standard procedure in the art. 

25 Where nucleic acids encoding Smad7 of variants thereof are employed, doses of between 1 ng 
and 0.1 mg generally will be formulated and administered according to standard procedures. 
Other protocols for the administration of Smad7 compositions will be known to one of ordinary 
skill in the art, in which the dose amount, schedule of injections, sites of injections, mode of 
administration (e.g., intra-tumoral) and the like vary from the foregoing. Administration of 

30 Smad7 compositions to mammals other than humans, e.g. for testing purposes or veterinary 
therapeutic purposes, is carried out under substantially the same conditions as described above. 
When administered, the pharmaceutical preparations of the invention are applied in 
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pharmaceutically-acceptable amounts and in pharmaceutically-acceptable compositions. The 
term "pharmaceutically acceptable" means a non-toxic material that does not interfere with the 
effectiveness of the biological activity of the active ingredients. Such preparations may 
routinely contain salts, buffering agents, preservatives, compatible carriers, and optionally 
5 other therapeutic agents. When used in medicine, the salts should be pharmaceutically 
acceptable, but non-pharmaceutically acceptable salts may conveniently be used to prepare 
pharmaceutically-acceptable salts thereof and are not excluded from the scope of the invention. 
Such pharmacologically and pharmaceutically-acceptable salts include, but are not limited to, 
those prepared from the following acids: hydrochloric, hydrobromic, sulfuric, nitric, 

10 phosphoric, maleic, acetic, salicylic, citric, formic, malonic, succinic, and the like. Also, 

pharmaceutically-acceptable salts can be prepared as alkaline metal or alkaline earth salts, such 
as sodium, potassium or calcium salts. 

Smad7 may be combined, if desired, with a pharmaceutically-acceptable carrier. The 
term "pharmaceutically-acceptable carrier" as used herein means one or more compatible solid 

15 or liquid fillers, diluents or encapsulating substances which are suitable for administration into 
a human. The term "carrier" denotes an organic or inorganic ingredient, natural or synthetic, 
with which the active ingredient is combined to facilitate the application. The components of 
the pharmaceutical compositions also are capable of being co-mingled with the molecules of 
the present invention, and with each other, in a manner such that there is no interaction which 

20 would substantially impair the desired pharmaceutical efficacy. 

The pharmaceutical compositions may contain suitable buffering agents, including: 
acetic acid in a salt; citric acid in a salt; boric acid in a salt; and phosphoric acid in a salt. 

The pharmaceutical compositions also may contain, optionally, suitable preservatives, 
such as: benzalkonium chloride; chlorobutanol; parabens and thimerosal. 

25 The pharmaceutical compositions may conveniently be presented in unit dosage form 

and may be prepared by any of the methods well-known in the art of pharmacy. All methods 
include the step of bringing the active agent into association with a carrier which constitutes 
one or more accessory ingredients. In general, the compositions are prepared by uniformly 
and intimately bringing the active compound into association with a liquid carrier, a finely 

30 divided solid carrier, or both, and then, if necessary, shaping the product. 

Compositions suitable for oral administration may be presented as discrete units, such 
as capsules, tablets, lozenges, each containing a predetermined amount of the active 
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compound. Other compositions include suspensions in aqueous liquids or non-aqueous liquids 
such as a syrup, elixir or an emulsion. 

Compositions suitable for parenteral administration conveniently comprise a sterile 
aqueous or non-aqueous preparation of Smad7 polypeptides or nucleic acids, which is 
5 preferably isotonic with the blood of the recipient. This preparation may be formulated 
according to known methods using suitable dispersing or wetting agents and suspending 
agents. The sterile injectable preparation also may be a sterile injectable solution or 
suspension in a non-toxic parenterally-acceptable diluent or solvent, for example, as a solution 
in 1,3-butane diol. Among the acceptable vehicles and solvents that may be employed are 

10 water, Ringer's solution, and isotonic sodium chloride solution. In addition, sterile, fixed oils 
are conventionally employed as a solvent or suspending medium. For this purpose any bland 
fixed oil may be employed including synthetic mono-or di-glycerides. In addition, fatty acids 
such as oleic acid may be used in the preparation of injectables. Carrier formulation suitable 
for oral, subcutaneous, intravenous, intramuscular, etc. administrations can be found in 

15 Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, PA. 

In another aspect of the invention, Smad7 polypeptides or nucleic acid are used in the 
manufacture of a medicament for modulating a TGF-p ? activin, BMP or Vgl response. The 
medicament can be placed in a vial and be incorporated into a kit to be used for increasing a 
subject's response to one or more of the above TGF-0 family members. In certain embodiments, 

20 other medicaments which modulate the same responses or which favorably affect the Smad7 
compositions can also be included in the same kit. The kits can include instructions or other 
printed material on how to administer the Smad7 compositions and any other components of the 
kit. 

25 Examples 
Methods 

Isolation of mSmad7 and hSmad7 cDNA and Northern blot analysis. cDNA encoding the 
complete mSmad7 was made by fusing mouse EST cDNA (AA061644) with a partial cDNA 
30 isolated from a mouse placenta library. The cDNA for hSmad7 was isolated by screening a 
human brain cDNA library. cDNAs were sequenced on both strands using an ABI3 1 0 Genetic 
Analyzer. Isolation of total RNA, Northern blot analysis was performed as described before 



WO 98/53068 PCT/US98/10305 

-39- _ 

(Afrakhte, M. et al., Int. J. Cancer 68, 802-809 (1996) using QUIKHYB® hybridization solution 
from Stratagene (La Jolla, CA). 

Expression Plasmids. Expression constructs for TpR-I, TpR-II, BMPR-IA, BMPR-IB, ActR-I, 
5 ActR-IB, Smadl, Smad2, Smad3 and Smad5 have been described previously (Nakao et al., 
EMBO J. 16:5353-5362, 1997). Other Smad or receptor plasmids have been previously 
described or were prepared according to standard protocols. F-Smad7 and F-Smad8 were made 
by a PCR-directed approach and subcloning into pcDNA-Flag. N-terminal and C-terminal 
Smad7 expression vectors were prepared using PCR amplification with specific primers and 

10 Smad7 cDNA as template. Expression constructs for the C-terminal domain of F-Smad7 (7C; 
amino acids 204-426), F-Smad7C with C-tail deletion (7CA; amino acids 204-207), Smad7 with 
deletion of C-tail (7 A; amino acids 1-407), the N-terminal domain of Smad7 (7N; amino acids 1- 
203), mouse F-Smad6 "long version" (F-Smad6L)(Imamura et aL, 1997) and human F-Smad6 
"short version" (F-Smad6S) (Topper et al., 1997), were made by a PCR-directed approach and 

15 were subcloned into pcDNA3-Flag. The resulting expression plasmids expressed N-terminal and 
C-terminal Smad domains tagged at their N-termini with the Flag epitope. Alternatively, the 
Smads and Smad fragments were subcloned into pMEP4 for inducible expression of the proteins. 
Anti-sense expression constructs for Smad molecules including mSmad7 and hSmad3 were 
prepared by cloning the complete Smad coding regions in reverse direction in pcDNA3 

20 expression vector (Invitrogen, Carlsbad, CA). 

Cell Assays. Transient transfection, metabolic labeling, immunoprecipitation, 
[ 32 P]orthophosphate labeling of cells, and SDS-PAGE were performed as described previously 
(Nakao etal., 1997). 

25 

Iodination of Ligand and Affinity Cross-linking. Iodination of TGF-pl and affinity cross- 
linking followed by immunoprecipitation were performed as previously described (Nakao et al., 
1997). TGF-pl and BMP-7 were iodinated using the chloramine-T method according to Frolik 
et al (J. Biol Chem. 259:10995-1 1000, 1984). Cross-linking was performed as previously 
30 described (Nakao et al., 1997). In some cases, incubation with 125 I-TGF-pi was performed at 
room temperature for 2 h (Souchelnytskyi et al., J. Biol Chem. 272:27678-27685, 1997) and the 
transfected cells lines were stimulated for 24 h with 1 00/^M/ml zinc chloride to induce the 
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expression of Smad7. Complexes of Smads and affinity-labeled receptors were 
immunoprecipitated with antiserum directed against epitope tags in Smads. To determine 
expression levels of receptors, aliquots of cell lysates were immunoprecipitated with antisera 
against type I receptors. Expression of Smads was determined by Western blotting on aliquots of 
5 cell lysates. 

Transcriptional response assay. MvlLu wild-type and R mutant cells were transiently 
transfected with p3TPLux using DEAE-dextran transfection method, as described before (Nakao 
et aL, 1997). HaCat cells were transiently transfected with p21 reporter plasmid (Datto et al., J. 

10 Biol Chem. 270: 28623-28628, 1995) using Transfectam reagent from Promega (Madison, WI). 
In each experiment, equal total amounts of DNA were transfected. In other experiments, MvlLu 
cells were transfected with p3TPLux and anti-sense expression constructs using Lipofectin; after 
incubation overnight in Optimum medium, the medium was changed to serum-containing 
medium. After another 24 h or incubation the cells were stimulated with TGF-pl and luciferase 

15 activity was measured. Luciferase activity was measured as previously described (Nakao et al., 
1997). The values were normalized for transfection efficiency using the p-gal reporter gene 
under transcriptional control of CMV promoter (pCMV5 vector). Results shown are 
representative of at least three or four independent experiments. 

Transient transfections of COS cells were performed using the DEAE-dextran protocol. 

20 Stable transfection of MvlLu cells with pMEP4 expression vector (Invitrogen) was done using 
calcium phosphate precipitation method, as previously described (Souchelnytskyi et al., 1997); 
selection was performed with 420 units/ml hygromycin. Induction with zinc chloride was done 
with lOOyuM zinc chloride for 20 h, unless indicated otherwise. Metabolic labeling of cells, 
[ 32 P]orthophosphate labeling, and immunoprecipitation, and SDS-PAGE were performed as 

25 earlier described (Souchelnyskyi et al., 1997 or Nakao et al., 1997). 

Xenopus embryo culture and manipulation. Xenopus eggs were obtained and embryos 
microinjected and cultured as described (Moon and Christian, Technique 1: 76-89, 1989). For 
animal cap assays, 200 pg of RNA encoding activin-P or a bone morphogenetic protein-Vgl 
30 chimera (Dale et al., EMBO J. 12: 4471-4480, 1993) was injected either alone, or together with 
400 pg of Smad7 RNA. For Smad7/Smad8 experiments, RNA was synthesized by in vitro 
transcription (Moon and Christian, 1989) of pCS2+Smad8 and pCS2+Smad7 (Nakayama et al., 
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Development 125:857-867, 1998). The latter construct was generated by subcloning the coding 
region of F-Smad7 into pCS2+ (Turner and Weintraub, Genes Dev. 8:1434-1447, 1994). 
Embryonic stages are according to Nieuwkoop and Faber {Normal table ofXenopus laevis, 
Garland Publishing, Amsterdam, North Holland, 1967). Immunostaining of whole embryos 
5 (12/101 antibody was obtained from the Developmental Studies Hybridoma Bank under contract 
N01-HD-2-3144 from the NICHD, Tor 70 was obtained from R. Harland), whole mount in situ 
hybridization, RT-PCR analysis of RNA extracted from cultured animal caps, and quantitation of 
brachyury expression relative to that of EF-la were performed as described (Lagna et al., Nature 
383: 832-836, 1996; Cui et al., Dev. Biol 180: 22-34, 1996; Moon and Christian, 1989). 

10 

Cell Lines. COS cells and MvlLu mink lung epithelial cells and HaCat cells (human 
keratinocytes) were obtained from American Type Culture Collection. Cells were cultured in 
Dulbecco's modified Eagle's medium (Life Technologies, Inc., Gaithersburg, MD) 
supplemented with 10% fetal bovine serum (FBS) and antibiotics (100 units of penicillin and 50 

15 jag of streptomycin per ml). All the 10 different human lung carcinomas used [small cell lung 
carcinoma (SCLC), U-1285, U-1690, H-69 and H-92; non-SCLC, U-1752; squamous cell 
carcinoma (SQC); U-1810; large-cell carcinoma (LCC), H-157, H-661; adenocarcinoma (ADC) 
H-23 and H-125 (Histological typing of lung tumors, according to WHO, Geneva)] were grown 
in RPMI 1640 medium with 10% FBS and antibiotics. (Heldin et al., Br. J. Cancer, 68: 708- 

20 711, 1993). 

RNA Isolation and Northern Blot Analysis. Cells were kept in 0.5% FBS 12-24 h prior to 
stimulation with various factors followed by RNA extraction. Lung carcinoma cells used for 
RNA extraction were grown in the presence of 10% FCS. Isolation of total RNA and Northern 

25 blotting was performed essentially as described (Afrakhte et al., Int. J. Cancer, 68/802-809, 
1996). The RNA extracted from the lung carcinomas cell lines was poly(A) + -enriched. 
Hybridizations were performed using QuickHyb buffer from Stratagene. The intactness and total 
amount of RNA (12 ^ig or total RNA or 5 \ig of poly(A) + RNA per lane) were checked by 
staining the gel with ethidium bromide and by hybridization with a glyceraldehyde 3 -phosphate 

30 dehydrogenase (GAPDH) probe. The cDNA probes used in the hybridizations were a 1 .8 kb 

Eco/Xho I human Smad7 fragment, a 2.0 kb EcoRI fragment of the human Smad6, a 3 kb Eco RI 
fragment of the human PAI-1, and a 1.5 kb mouse JunB fragment. For the GAPDH 
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hybridizations the entire human GAPDH cDNA plasmid was labeled. 
Antibodies 

Antibodies against Smad2, termed DQQ, have been previously described (Nakao et al., 
5 1997). Anti-Flag antibody was purchased from Kodak (New Haven). Specific antisera for 
Smad6, termed ESP and SRQ were raised against ESPPPPYSRLSPRDEYKPLD (SEQ ID 
NO: 1 1) and SRQFITSCPCWLEILNPR (SEQ ID NO: 12), respectively. Specific antisera raised 
against Smad7, termed KER and KAV, were raised against the synthetic peptides 
KERQLELLLQAVESRGGTRTA (SEQ ID NO: 13) and KAVRGAKGHHHPHPP (SEQ ID 
10 NO: 14), respectively. The peptides were coupled to keyhole limpet haEmocyanin (Calbiochem- 
Behring) with glutaraldehyde, mixed with Freund's adjuvant, and used to immunize rabbits 
according to standard procedures. 

Growth Inhibition assay 

15 MvlLu cells were seeded at 1X1 0 4 cells/well in 24-well plates. Prior to addition of 

TGF-pl the MvlLu cells were simultaneously treated (or not) with 100 jjM zinc chloride and 
indicated concentrations of TGF-pl for 20h to induce Smad7 expression. Before harvesting, the 
cells were pulsed with o.2 /*Ci of [ 3 H] thymidine (6.7 Ci/mmol, Amersham, UK) for 2h. The 
cells were fixed with ice-cold 5% trichloroacetic acid (TCA) for more than 20 min, washed twice 

20 with 5% TCA and once with water. Solubilization of the cells was done with 400//1 of 0. 1 M 
NaOH for 20 min at room temperature. The 3 H-radioactivity incorporated into DNA was 
determined by liquid scintillation counting. 

Cells were grown in LAB TEK chambers (Nunc, Naperville, IL), and incubated with 
25 DMEM containing 0.3% FCS in the absence or presence of TGF-pl for 2h. The slides were 

washed once with phosphate buffered saline (PBS), fixed for 10 min with 4% paraformaldehyde, 
followed by three washes with PBS, subsequently permeabilized with 0.1% Triton X-100 in PBS 
for 5 min, and again washed three times with PBS. Slides were blocked by 10% goat serum for 
lh in room temperature, then incubated with 10% goat serum with anti-FLAG antibody 
30 (20/zg/ml) for 15 h at 4°C. The slides were subsequently washed 3 times, incubated with 
TRITC-conjugated goat anti-mouse IgG antibody (diluted 1 :40) and washed again 4 times. 
Nuclei were stained with DAPI (lyug/ml) for 10 min in room temperature followed by three 
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washes. In order to visualize the fluorescence, a Zeiss microscope, or laser confocal microscope 
was used. For cell counting a square lattice mounted in one of the eyepieces was used. In the 
COS cell experiments 200 cells were counted in 5 different field and the nuclear localization was 
checked by DAPI staining. 

5 

Example 1: Cloning and characterization of Smad7 

A database search for mammalian sequences related to Smads revealed the existence of 
two mouse expression sequence tags (ESTs), i.e. AA022262 and AA061644, with sequence 
similarity to N- and C-terminal Smad domains, respectively. Polymerase chain reaction (PCR) 

10 using kidney cDNA as a template with a PCR primer pair derived from both ESTs resulted in a 
specific product. A cDNA encoding the complete mouse protein was made by fusing mouse 
EST cDNA with a cDNA isolated from a mouse placenta library, and was termed mouse Smad7 
(mSmad7). Human Smad7 cDNA (hSmad7) was isolated from a human brain cDNA library. 
The cDNA sequences predict that mSmad7 and hSmad7 have 426 amino acid residues with 98% 

15 identity (Fig. la). 

All differences between hSmad7 and mSmad7 are found in the N-terminal domain. The 
borders of the Mad homology 2 (MH2) domain are indicated by arrows. The nucleotide 
sequences are deposited in European Molecular Biology Laboratory/GenBank data library 
(mouse and human Smad7 accession numbers are AFO 152260 and AF0 15261, respectively). 

20 Smad7 is most related to Smad6 (Imamura et al., Nature, 389:622-626, 1997) with 36% and 56% 
sequence identities in the N-terminal domain and the C-terminal MH2 domain, respectively. The 
Smad7 N-terminal domain, which contains a glycine/glutamic acid residue-rich region, shows 
only weak similarity (approximately 15%) to the MH1 domains found in Smadl through Smad5. 
RNA blot analysis of various tissues with a probe from the region coding for the N- 

25 terminal domain of Smad7 revealed one major transcript of approximately 4.4kb (Fig. lb). 
Among the tissues analyzed, the highest expression of Smad7 was found in the lung. 

Example 2: Smad 7 modulates TGF-P superfamily signal transduction 

To investigate whether Smad7 modulates the responsiveness to TGF-p, the TGF-p- 
30 inducible luciferase p3TPLux reporter construct, which contains the TGF-[}-inducible PAI-1 

promoter, was transfected into MvlLu mink epithelial cells in the absence or presence of Smad7 
cDNA. Smad7 was found to exert a dose-dependent inhibition TGF-P-induce luciferase activity 
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(Fig. 2a). Moreover, the induction of p3TPLux luciferase by a constitutively active variant of 
TpR-I, when transfected in R-mutant cells, was also inhibited by cotransfection with Smad7, as 
was the response by a constitutively active variant of the structurally related type I receptor for 
activin (ActR-IB) (Fig. 2B). Transfection of Smad2 did not affect TGF-pl -induced p3TPLux 
5 luciferase response in MvlLu cells (Fig. 2a). This inhibitory effect was specific as Smad7 did 
not inhibit the phorbol 12-myristate 13 -acetate (PMA)/epidermal growth factor-induced 
p3TPLux luciferase response. In addition, the forskolin-mediated transcriptional induction using 
a cAMP-responsive-element-containing reporter construct was not affected by Smad7. These 
results indicate that Smad7 is a potent negative regulator of both TpR-I- and ActR-IB -induced 

10 p3TPLux response. 

To examine the effect of Smad7 expression on TGF-p-mediated growth inhibition, a 
luciferase transcriptional reporter construct containing the p21 CDK inhibitor promoter (p21Lux) 
that is induced by TGF-P in human keratinocytes (HaCat) (Imamura, 1997) was used. Smad7 
was found to antagonize TGF-P 1 -mediated p21Lux response in HaCat cells dose-dependently 

15 (Fig. 2c). Increased expression of Smad2 by transfection slightly stimulated this response, 
whereas Smadl transfection had no effect (Fig. 2c). Smad7 and N-terminally Flag-tagged 
Smad7 gave essentially the same antagonistic results on TGF-p signaling, suggesting that N- 
terminal tagging does not interfere with functional properties of Smad7. Thus, Smad7 inhibits 
TGF-P-induced pathways leading to extracellular matrix production as well as growth inhibition. 

20 To determine whether Smad7 can inhibit transduction of TGF-p/activin-like signals in 

vivo, patterning defects caused by overexpression of Smad7 in Xenopus embryos were analyzed. 
When the endogenous activin signaling pathway is inactivated in early Xenopus embryos, by 
introduction of dominant negative forms of either an activin receptor (Hemmati-Brivanlou et al., 
Nature 359: 606-614, 1992; Chang et al., Development 124: 827-837, 1997) or of Smad4 (Lagna 

25 et al., Nature 383: 832-836, 1996), mesoderm fails to form. Similarly, microinjection of RNA 
encoding Smad7 into both blastomeres of two-cell embryos inhibited mesoderm formation (Fig. 
2d-g). Five hundred picograms of synthetic RNA encoding either Smad7 or myc-epitope tag 
(MT) was injected into each blastomere of two-cell embryos as shown schematically (Fig. 2d) 
and embryos were cultured to the tailbud stage. Embryos injected with MT RNA developed 

30 normally (top embryo in each panel) whereas embryos made to misexpress Smad7 (bottom 

embryo in each panel) failed to form head or tail structures (Fig. 2e) and showed a complete or 
partial loss of muscle (Fig. 2f) and notochord (Fig. 2g). Arrows indicate immunoreactive muscle 
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(Fig. 2f) and notochord (Fig. 2g) in MT-injected embryos. Specifically, head and tail structures 
were absent or severely deficient in 80% of injected embryos (n=168) (Fig. 2e) as were 
mesodermal derivatives such as muscle (n=60) (Fig. 2f) and notochord (n=48) (Fig. 2g). 

To further test the ability of Smad7 to block mesoderm formation in vivo, embryos were 
5 analyzed for expression of brachyury, a gene that is expressed throughout the presumptive 
mesoderm during gastrulation (Smith et al., Cell 67:79-87,1991) (Fig. 2h). Five hundred 
picograms of synthetic RNA encoding MT (Fig. 2h) or Smad7 (Fig. 2i) was injected into one 
blastomere of two-cell embryos near the equator and expression of brachyury analyzed by whole 
mount in situ hybridization. MT-injected embryos show the typical ring of brachyury expression 

10 (Fig. 2h) while brachyury transcripts are not detected on one side of Smad7-injected embryos 
(Fig. 2i, arrow). As shown in Fig. 2i, injection of Smad7 RNA into the equatorial region of one 
blastomere of two-cell embryos prevented brachyury expression on one side of the embryo. 

Two members of the TGF-p family, activin and Vgl, are candidates for being 
endogenous mesoderm- inducing molecules (Kessler et al., Science 266: 596-604, 1994). A 

15 Xenopus animal cap assay (Kessler, 1994) was usedto directly test the ability of Smad7 to block 
signaling downstream of these ligands. While both activin and Vgl induced expression of 
brachyury in ectodermal explants (animal caps), coexpression of Smad7 inhibited brachyury 
induction of 60%, thereby demonstrating that Smad7 can block activin and Vgl -mediated 
mesoderm induction. 

20 To obtain insight into the mechanism by which Smad7 exerts its negative role in 

signaling by TGF-p family members, it was tested whether Smad7, like Smad2 and Smad3 
(Zhang et al., Nature 383:168-172, 1996; Macias-Silva et al., Cell 87:1215-1224, 1996; Nakao et 
al., 1997), can associate with the TGF-p receptor complex. COS cells transfected with 
N-terminally Flag-tagged Smad7 in combination with TpR-II (wild-type, WT, or kinase- 

25 deficient mutant, KD) and TpR-I (wild-type or kinase deficient mutant) were affinity labeled 
with 125 I-TGF-pi, and cell lysates were subjected to immunoprecipitation with Flag antiserum 
against the Flag-epitope in Smad7. Immunoprecipitates were analyzed by SDS-PAGE and FujiX 
Bio-Imager. Expression of receptors and F-Smad7 after transfection was determined by 
immunoprecipitation with VPN antibody against TpR-I or DRL antibody against TpR-II on 

30 aliquots of cell lysates, and by immunoprecipitation with Flag antibody on lysates [ 35 S] 

methionine/cysteine labeled transfected cells. It was found that Smad7 interacted very efficiently 
with the TGF-p receptor complex (Fig. 3). Smad7 interacted with wild-type TpR-I as well as 
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kinase inactive TpR-I in complex with wild-type TpR-II. In contrast, Smad2 and Smad3 interact 
stably only with complex of kinase-deficient TpR-I and wild-type TpR-II (Macias-Silva et al., 
1996; Nakao et al., 1997). No interaction was observed between Smad7 and a heteromeric 
complex of kinase-deficient TpR-II and TpR-I, or with TpR-II alone (Fig. 3). Thus, 
5 transphosphorylation of TpR-I by TpR-II kinase is required for association of Smad7 with 
TpR-I. 

Example 3: Phosphorylation of Smad7 

Phosphorylation of Smad2 and Smad3, but not Smad4, is induced after TGF-p 

10 stimulation (Eppert et al., Cell 86: 543-552, 1996; Zhang et al., 1996; Macias-Silva et al., 1996; 
Nakao et al., 1997; Lagna et al., 1996. It was investigated whether Smad7 is phosphorylated 
upon association with TGF-P receptors. COS cells were transfected with Smad7 (or for 
comparison with Smad2) in the absence or presence of receptors, labeled with 
[ 32 P]orthophosphate, and treated with TGF-P 1 . Cell ly sates were subjected to 

15 immunoprecipitation with antisera against Smads or epitope tags, and the level of Smad 

phosphorylation was determined. Expression of type I receptors and Smads was analyzed by 
immunoprecipitation on aliquots of cell lysates, which had been labeled with [ 35 S]- 
methionine/cysteine. The 32 P- or 35 S-radioactivity associated with Smad2 in (Fig. 4b) or Smad3 
in (Fig. 4d) were quantitated by using a FujiX Bio-Imager and the 32 P/ 35 S-ratio calculated (Fig. 

20 4c and e, respectively). Representative results of three independent experiments are shown. As 
expected, Smad2 phosphorylation was increased upon coexpression with receptors, and addition 
of TGF-P 1 led to a further increase of Smad2 phosphorylation. However, no (or only very weak) 
phosphorylation of Smad7 was observed in cells transfected with constitutively active forms of 
TpR-I and ActR-IB or in cells transfected with wild-type TpR-I and TpR-II after stimulation 

25 with ligand (Fig. 4a). Thus, in spite of its direct association with TpR-I, Smad7 is not a direct 
substrate for the TpR-I kinase. Notably, Smad7 lacks the conserved SS(M/V)S C-tail sequence 
motif (Fig. la), which in the case of Smad2 (Macias-Silva et al., 1996) and Smadl (Kretzschmar 
et al., Genes Dev. 1 1 : 984-995, 1997) are phosphorylated by type I receptor kinase. 

30 Example 4: Smad7 reduces phosphorylation of Smad proteins 

Smad7 may exert its negative role in TGF-P signaling by interfering with activation of 
other Smads. Therefore it was examined whether Smad7 affects the phosphorylation of Smad2 
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and Smad3 using [ 32 P]orthophosphate-labeled COS cells transfected with TGF-p receptors and 
Smads. COS cells were transfected with F-Smad7 or Smad2 (Fig. 4a), Smad2 alone or together 
with Smad7 (Fig. 4b), and N-terminally Myc-tagged Smad3 (M-Smad3) alone or together with 
F-Smad7 (Fig. 4d) in the absence or presence of wild-type or constitutive active (c.a.) receptors, 
5 whereafter cells were labeled with [ 32 P]orthophosphate. C-terminally HA-tagged type I receptors 
were used. Interestingly, Smad7 abolished the TGF-p 1 -induced phosphorylation of Smad2 (Fig. 
4b,c) and Smad3 in dose-dependent manner (Figs. 4d,e). A Smad7 level twice as high as Smad3 
appeared sufficient to cause a significant decrease in TGF-P-induced receptor-mediated Smad3 
phosphorylation. As previously reported (Macias-Silva et aL, 1996; Nakao et al., 1996), it was 

10 found that Smad2 and Smad3 become phosphorylated when overexpressed in COS cells. For 
Smad2 this receptor-independent phosphorylation does not occur at the SSMS motif in the C- 
terminal tail (Macias-Silva et aL, 1996), and may possibly occur via non-receptor kinases. 
Smad7 appears not to block this receptor-independent phosphorylation. Since activation of 
Smad2 and Smad3 is essential for optimal TGF-p signaling (Zhang et al., 1996; Macias-Silva et 

15 aL, 1996; Nakao et aL, 1997; Lagna et aL, 1996; Kretzschmar et aL, 1997), the inhibition of their 
phosphorylation provides a mechanistic explanation for the antagonistic effect of Smad7. The 
association of Smad7 with TpR-I suggests that it may compete with Smad2 and Smad3 for 
receptor binding. In accordance with this notion, co-transfection of Smad2 or Smad3 with 
Smad7 reduced the antagonistic effect of Smad7 in a TGF-P 1 -induced p3TPLux assay. In 

20 addition, it was determined that Smad7 not only inhibits TGF-P and activin signaling, but also 
blocks BMP signaling. Using methodology described herein, it was determined that Smad7 
associated with BMPR-Is and inhibited phosphorylation of Smadl in cultured mammalian cells. 
In addition, injection of Smad7 RNA into ventral cells of Xenopus embryos phenocopies the 
effect of blocking the BMP signaling pathway, and led to formation of an incomplete secondary 

25 dorsal axis. 

Example 5: Regulation of Smad7 expression by TGF-p 

Regulation of Smad7 expression by signaling molecules may be used to effectively 
modulate TGF-p responses. Therefore, it was investigated whether the expression of Smad7, 
30 and for comparison Smad2, Smad3 and Smad4, were regulated by TGF-p 1 . Northern blots with 
20 ^g/lane total RNA from TGF-pi stimulated MvlLu cells, SW1736 human anaplastic thyroid 
carcinoma cells and HaCat cells probed with Smad7 revealed that Smad7 mRNA was rapidly 
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induced in response to lOng/ml TGF-pl stimulation (Fig. 5a,b). The amount of RNA loaded was 
checked by hybridization of filters to a glyceraldehyde-3 -phosphate dehydrogenase (GAPDH) 
cDNA probe. In MvlLu cells, Smad7 mRNA was induced 4-fold after 30 min of TGF-p 
stimulation. Expression of Smad2, Smad3 and Smad4 upon TGF-(il stimulation remained 
5 unchanged (Fig. 5a). Induction of Smad7 mRNA by TGF-pl was observed in the presence of 
20 jUg/ml cycloheximide (CHX), indicating that de novo protein synthesis is not required for 
TGF-pl -mediated induction of Smad7. In fact, Smad7 mRNA was superinduced in the presence 
of TGF-pl and 20 /xg/ml CHX added 30 min. prior to TGF-pl (Fig. 5b). This is likely caused by 
increase in mRNA stability or loss of transcriptional repressors by CHX. Treatment with CHX 

10 alone only slightly increased the basal Smad7 mRNA level. In contrast, 10 nM PMA, which like 
TGF-pi stimulates PAI-1 expression and inhibits the growth of MvlLu cells, did not induce 
Smad7 expression (Fig. 5c). In MvlLu cells lacking functional TpR-I (R mutant), TGF-pl had 
no effect on Smad7 mRNA expression. Taken together these data indicate that Smad7 is an 
immediate-early response gene for TGF-pi, and activated complexes of Smad2, Smad3 and 

15 Smad4 may directly and/or in combination with DNA binding cofactors act on the Smad7 
promoter. The intracellular antagonist Smad7 may thus act in a negative feedback loop to 
regulate the intensity or duration of the TGF-P signal. 

Example 6: Smad7 interacts with BMP receptor complexes and inhibits BMP-mediated 
20 signaling. 

To determine if Smad7 associates with the BMP receptor complex, COS cells were 
transfected with the BMP receptors BMPR-IA, BMPR-IB, or ActR-I (wild-type or kinase- 
deficient mutant) tagged at C-terminus with HA epitope, and BMPR-II (wild-type or kinase- 
deficient mutant), in the absence or presence of N-terminally Flag-tagged Smad7 (F-Smad7). 

25 The receptors were covalently affinity labeled with 125 I-BMP-7, and cell lysates were subjected to 
immunoprecipitation with Flag antibody to immunoprecipitate F-Smad7. Immunoprecipitates 
were analyzed by SDS-PAGE and FujiX Bio-Imager. Expression of receptors and F-Smad7 (not 
shown) after transfection was determined by immunoprecipitation with HA-antisera on aliquots 
of cell lysates, and by immunoprecipitation with Flag antibody on lysates from [ 35 S] 

30 methionine/cysteine labeled transfected cells. As shown in Figs. 6A-F, TGFsuperfamily 

receptors coimmunoprecipitated with F-Smad7. Therefore, Smad7 associated with BMPR-IA 
and BMPR-IB, and less efficiently with ActR-I receptors. As shown in Fig. 6B, no interaction 



WO 98/53068 PCT/US98/1 0305 

- 49 - _■ 

was observed between Smad7 and heteromeric complexes of a kinase-deficient type II receptor 
and a wild-type or a kinase-deficient type I receptor. Thus Smad7 only interacts with type I 
receptors which are phosphorylated by the type II receptor (i.e. activated). Among the tested 
type I receptors, Smad7 preferentially interacts with BMPR-IA, BMPR-IB, TpR-I and likely 
5 ActR-IB, and less efficiently with ActR-I. In addition, the effect of Smad7 on BMPR-IB- and 
ActR-I-mediated phosphorylation of Smadl or Smad5 was examined (Fig. 6 C-F). Fig. C shows 
that Smad 7 inhibits BMPR-IB- and ActR-I-mediated phosphorylation of Smadl. COS cells 
were transfected with F-Smadl alone or together with F-Smad7 in the presence of type I 
receptors (BMPR-IB or ActR-I) and BMPR-II in the absence or presence of BMP-7, whereafter 

10 cells were labeled with [ 32 P]orthophosphate and immunoprecipitated with Flag antisera. 

Expression of F-Smadl and F-Smad7 was determined by immunoblotting with Flag on aliquots 
of cell lysates. Fig 6D shows that the 32 P -radioactivity associated with Smadl in Fig. 6C as 
quantified by using a FujiX Bio-Imager and plotted. Fig. 6E shows that Smad7 inhibits BMPR- 
IB phosphorylation of Smad5. COS cells were transfected with F-Smad5 alone or together with 

15 F-Smad7, in the presence of BMPR-IB and BMPR-II; cells were incubated in the absence or 
presence of BMP-7, whereafter they were labeled with [ 32 P]orthophosphate and subjected to 
immujnoprecipitation with Flag antisera. Fig. 6F shows the 32 P-radioactivity associated with 
SmadS in Fig. 6E as quantified using a FujiX Bio-Imager and plotted. COS cells were 
transfected with cDNAs encoding BMP receptors and Smadl or SmadS in the absence or 

20 presence of Smad7. The phosphorylation level of Smadl or SmadS was examined by 

[ 32 l']orthophosphate labeling and immunoprecipitation from cell lysates using Smad antisera. 
Smad7 potently inhibited BMPR-IB-mediated phosphorylation of Smadl and SmadS in a dose 
dependent manner (Fig.6 C-F). In addition, Smad7 inhibited ActR-I-mediated phosphorylation 
of Smadl (Fig.6 C, D). The specificity of the interaction based on the phosphorylation of the 

25 type I receptor by type II receptor kinase strongly suggests physiological importance of the 
interaction. Phosphorylation may induce a conformational change that is required for Smad7- 
type I receptor interaction. 

To determine if Smad7 inhibits BMP-mediated phosphorylation of pathway specific 
Smads, COS cells were transfected with F-Smadl or F-Smad5, alone or together with F-Smad7 

30 in the absence or presence of constitutive active (c.a.) BMPR-IB (Fig. 7A), or alone or together 
with F-Smad7, type I receptors and BMPR-II in the absence or presence of BMP-7 (Fig. 7C), 
whereafter cells were labeled with [ 32 P]-orthophosphate. The amount of transfected Smad7 is 
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indicated in Fig. 7C (0.1 or 1.0 microgram). Cell lysates were subjected to immunoprecipitation 
with antisera against Smads, and the level of phosphorylation determined. Expression of Smads 
and BMPR-IB was analyzed by immunoprecipitation with antisera directed towards epitope tags 
on aliquots of cell lysates, which had been labeled with [ 35 S] methionine/cysteine. In Fig. 7B, 
5 the 32 P- or 35 S -radioactivity associated with Smadl were quantitated by using FujiX Bio-Imager 
and the 32 P/ 32 S-ratio calculated. Fig. 7 shows that Smad7 potently inhibited phosphorylation of 
by BMPR-IB-mediated Smadl or Smad5 phosphorylation in a dose-dependent manner. 
Consistent with the differential receptor binding, BMPR-IB-mediated Smad phosphorylation was 
more efficiently inhibited by Smad7 than was ActR-I -mediated Smad phosphorylation. 

10 

Example 7: Analysis of Smad7 N-terminal and C-terminal domain function 

To determine if the C-terminal domain of Smad7 associates with the TGF-P receptor 
complex, COS cells were transfected with wild-type N-terminally tagged mouse Smad7 (F- 
mSmad7) or a N-terminally Flag tagged mouse Smad7 C-terminal domain (amino acids 204-426 

15 of mSmad7) in combination with wildtype TGF-P receptors. The receptors were covalently 

affinity labeled with 125 I-TGF-pl. Immunoprecipitates were analyzed by SDS-PAGE and FujiX 
Biolmager. Expression of receptors and Smad7 after transfection was determined by 
immunoprecipitation with VPN antisera against TpR-I on aliquots of cell lysates, and by 
immunoprecipitation with Flag antibody on cell lysates from 35 S methionine/cysteine labeled 

20 transfected cells. As shown in Fig. 8A, the TGF-P receptor complex coprecipitates with Smad7 
C-terminal domain. Therefore, the Smad7 C-terminal domain associates with TGF-P receptors. 
Next, the C-terminal domain was tested in an assay of TGF-P-induced p3TPLux response, as 
described above. As shown in Fig. 8B, the C-terminal domain inhibits, albeit less efficiently as 
wild-type Smad7, the TGF-p-induced p3TPLux response. Fig. 8B, panel 2 shows that 

25 transfection of Smad7C blocks TGF-p-induced p3TPLux transcriptional response, but less 
efficiently than will-type Smad7. 

The inhibitory effect of N- and C-terminal Smad7 domains and Smad8 on TGFp 
superfamily receptor responses was tested. Smad8 and Smad7 deletion constructs were tested for 
effects on TpR-I-mediated phosphorylation of Smad2 or Smad3. COS cells were transfected 

30 with TpR-I and TpR-II and indicated Smads. Phosphorylation of Smads was determined by a 
32 P orthophosphate labeling assay. Fig. 8C shows that Smad8 and N- and C-terminal (MH2) 
domains of Smad7 each inhibited TpR-I-mediated phosphorylation of Smad2 or Smad3. The 
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effect of the C-terminal domain was achieved at lower expression levels than the N-terminal 
domain. Consistent with the ability of the C-domain of Smad7 to inhibit TpR-I -mediated 
phosphorylation of Smad2, this domain was sufficient to inhibit TGF-P-induced transcriptional 
activation, although it was less efficient at doing so than was wild-type Smad7 (Fig. 8B, panel 2). 

5 Deletion of the last 19 amino acids residues completely abrogated the ability of the Smad7 C- 
domain to inhibit TGF-P-induced transcriptional activation of the 3TP promoter. 

Fig. 8D shows the inhibitory effect of Smad7 N- and C-terminal domains and Smad8 (as 
compared to Smad7 wild-type) on TGF-P receptor mediated p3TPLux transcriptional response. 
MvlLu cells were transfected with p3 TP-Lux reporter construct with or without different 

10 quantities of inhibitory Smads. The induction of luciferase expression upon cell incubation with 
TGF-p was measured. Consistent with their differential phosphorylation inhibitory effects, 
Smad8 and the Smad7 domains inhibited the TGF-p mediated transcriptional response of the 
reporter gene. 

Fig. 8E shows that Smad7C inhibits TpR-I-mediated phosphorylation of Smad2, but less 
15 efficiently than wild-type Smad7. In the absence of inhibitory Smad, TGF-P 1 stimulated a 1 .6- 
fold increase in Smad2 phosphorylation level, which was completely blocked by cotransfection 
with 1/^g wild-type Smad7, whereas in the presence of lyug Smad7C a 1.2-fold increase in TGF- 
p-mediated Smad2 phosphorylation was observed. COS cells were transfected with F-Smad2 
alone or together with F-Smad7C or wild-type F-Smad7 in the presence of BMPR-IB and 
20 BMPR-II; cells were then incubated in the absence or presence of BMP-7, whereafter they were 
labeled with [ 32 P]orthophosphate and subjected to immunoprecipitation with Flag antisera. 
Expression of F-Smad2, F-Smad7 and F-Smad7C was determined by immunoblotting with Flag 
on aliquots of cell lysates. 

25 Example 8: Smad6 is Highly Conserved and Most Closely Related to Smad7 

A database search for human and mouse sequences related to Smad7 revealed the 
existence of an expressed sequence tag (EST; Genbank accession number N95582) 
corresponding to a novel human Smad. By screening of a human placenta cDNA library we 
obtained a 3.2 kb cDNA clone. The entire coding region of this clone was sequenced using ABI 
30 Prism 310 Genetic Analyzer, and sequence analysis was performed with DNASTAR. Sequence 
analysis revealed 92% amino acid similarity to mouse Smad6 (Imamura et al., Nature? 389. 622- 
626, 1997), and therefore this gene was termed human Smad6 (hSmad6) (Fig. 9A). Fig. 9 A 
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shows sequence comparison of human and mouse Smad6 and human Smad7. Identical residues 
are boxed. The borders of the Mad-homology MH2 domain are indicated by arrows. The 
Genbank accession number for human Smad6 is AF 043 640. The sequence identity between 
human and mouse Smad6 is lower compared to that observed with other Smads, e.g., human and 
5 mouse Smad7 are 98% identical (Hayashi et al., 1997; Nakao et al., 1997). Fig. 9B shows 
pairwise alignment relationship between hSmadl through hSmad7 and hSmad9. Among the 
human Smads identified to date, hSmad6 is most closely related to hSmad7 (41% overall 
sequence identity). Like the N-terminal domain of Smad7, the N-terminal domain of Smad6 
shows very weak similarity to MH1 domains from Smadl through Smad5. Notably hSmad6 

10 lacks the conserved SS(M/V)S motif in its carboxy-terminal tail, which in the case of pathway- 
restricted Smads is phosphorylated by appropriate type I receptor kinases. Recently Hata et al. 
reported a hSmad6 sequence (accession number AF035528) (Hata et al., Genes Dev., 12:186- 
197, 1998). Comparison of the two sequences reveals one nucleotide difference in the coding 
region resulting in one amino acid difference (codon 21 in AF043640 is predicted to be an 

1 5 aspartic acid residue). 

Example 9: Smad6 and Smad7 Expression in Lung Cancer Cell Lines 

As previously reported, the distribution of Smad6 mRNA in various human tissues 
revealed that Smad6 was broadly expressed in various human tissues. One Smad6 transcript of 

20 approximately 3 kb was detected (Imamura, T., et al. 1997). Interestingly, the expression 

profiles of Smad6 and Smad7 (Nakao et al., 1997) were very similar with the highest expression 
in the lung. Therefore, a panel of 10 different lung carcinoma cell lines (four SCLC and six non- 
SCLC) was examined for Smad6 and Smad7 mRNA expression (Fig. 9C). The highest Smad6 
mRNA expression was detected in the SCLC cell lines H-69 and H-82, and in the non-SCLC cell 

25 lines H-661 and H-23, whereas the SCLC cell line U-1690, H-69 and H-82, and the non-SCLC 
cell lines H-157 and H-125 expressed the highest levels of Smad7 mRNA. Thus, the two genes 
are differently expressed, and no correlation was observed between Smad6 and Smad7 
expression in these cells. 

30 Example 10: TGF-P Family Members Induce Smad6 and Smad7 mRNA in MvlLu and 
HaCat Cells 

Mink lung epithelial (MvlLu) and human keratinocyte (HaCat) cell lines are responsive 
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to TGF-pl, activin and BMP-7 (Yashamita et al. J. Cell Biol. 130:217-226, 1995). Smad7 
mRNA is rapidly included by TGF-pi (Nakao, A., 1997). Northern blot analysis of Smad6 and 
Smad7 expression was performed on RNA prepared from MvlLu and HaCat cells stimulated 
with TGF-pl (10 ng/ml), activin (50 ng/ml) and BMP-7 (500 ng/ml). Phosphor Imager 

5 quantitation revealed that (after 90 min of ligand stimulation and normalizing with GAPDH 
mRNA expression level) in MvlLu cells (Fig. 10A Panel 1), TGF-pl, activin and BMP-7 
induced Smad6 mRNA expression 6-, 2- and 3 -fold, respectively, and induced Smad7 mRNA 
expression 5-, 2-, and 3-fold, respectively, and in HaCat cells (Fig. 10A Panel 2), TGF-pl, 
activin and BMP-7 induced Smad6 mRNA expression 4-, 2- and 12-fold, respectively, and 

10 included Smad7 mRNA expression 6-, 3-, and 6-fold, respectively. Both genes were rapidly 
induced by all three ligands (Fig. 10A). Smad6 and Smad7 were induced with similar kinetics 
by TGF-P and activin, with a major peak of expression after 90 min of stimulation. A second 
peak of Smad6 and Smad7 mRNA expression was observed after 24 hr of stimulation MvlLu 
cells, in particular after TGF-pl stimulation (Fig. 10A). There was a difference in the BMP7- 

15 induced mRNA expression profiles for Smad6 and Smad7 in MvlLu cells: whereas Smad7 
mRNA expression peaked at 90 min and was reduced to low levels thereafter, Smad6 mRNA 
remained high after 90 min of stimulation (Fig. 10A). Treatment of cells with 0.5% FCS, prior 
to ligand treatment, was found to decrease the basal level of inhibitory Smad expression, making 
the mRNA inductions observed with ligands more pronounced. The expression of Smad6 and 

20 Smad7 mRNA after stimulation with ligands were similar in cultures of high and low cell 
density. 

As shown in the Examples above, Smad7 not only inhibits TGF-P and activin, but also 
BMP signaling. Smad7 associates with BMP type I receptors and inhibits BMPR-I-mediated 
signaling. Thus Smad7 may exert a negative feedback control on signaling by TGF-P 1, activin 

25 as well as BMP-7. 

Example 11: Smad6 is a Direct Target Gene for TGF-pi 

Previously it was demonstrated that the induction of Smad7 mRNA by TGF-pl was 
observed in the presence of cycloheximide (CHX), indicating that de novo protein synthesis is 
not required for this response. Induction by TGF-pl of Smad6 mRNA was also seen after 

30 addition of CHX (Fig. 10B). CHX (20 \ig/m\) was added 30 min before TGF-P 1. The induction 
of Smad6 mRNA was prolonged in the presence of both TGF-P 1 and CHX when compared to 
TGF-pl alone (Fig. 10B), probably as a result of CHX-induced loss of transcriptional repressors 
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or increase in Smad6 mRNA stability. Based upon these results, Smad6 and Smad7 transcription 
are likely to be directly regulated by pathway-restricted and common mediator Smads. 
Therefore, the effect of TGF-pl on MDA-MB-468 breast cancer cells that lack Smad4 (Lagna et 
al., Nature, 383:832-836, 1996) was examined (Fig. 10C). Smad6 mRNA expression was not 
5 affected by TGF-pl in these cells. Smad7 expression was below detection level in the absence 
or presence of TGF-pl, or possibly the Smad7 gene may be absent in these cells. Taken 
together, the results indicate that Smad6 and Smad7 are immediate-early response genes for 
TGF-P family members. 

10 Example 12: Transfection of Anti-sense Expression Construct Enhances Cellular 
Responsiveness to TGF-P 1 

To investigate whether the expression level of inhibitory Smads modulate the 
responsiveness to TGF-P, TGF-P 1 -induced transcriptional response was measured in MvlLu 
cells transfected with an anti-sense Smad7 cDNA expression construct. MvlLu cells that are 

15 grown in the presence of serum have relatively high constitutive Smad7 expression (Fig. 9C), 
and Smad7 is rapidly induced by TGF-P 1 (Fig. 10A). Transfection of anti-sense Smad7 
expression construct (mSmad7r), but not empty vector or anti-sense Smad3 expression construct 
(hSmad3r), increased TGF-P 1 -mediated p3TPLux transcriptional response in MvlLu cells. 
TGF-P 1 (10 ng/ml) was added 4 h prior to lysis of cells to measure luciferase activity. Data are 

20 presented as mean +/- SEM. Transfection of MvlLu cells with 20 \xg of anti-sense Smad7 
expression plasmid increased the TGF-pl response, measured as activation of the PAI-1 
promoter containing reporter, (p3TPlux) after a 4 h stimulation. The increased TGF-p response 
in the Smad7 anti-sense transfected cells was 2-4 fold compared to control cells transfected with 
empty expression vector (Fig. 1 1); the response was dependent on the amount of transfected anti- 

25 sense expression construct. Transfection with anti-sense Smad3 expression construct showed no 
or very weak inhibitory response (Fig. 11). These results indicate that the expression level of 
Smad7 determines the cellular responsiveness to TGF-pi. 

Example 13: Effect of EGF and PMA on Smad6 and Smad7 mRNA Expression 

30 To examine the possibility of cross-talk between different signaling pathways, the effects 

of forskolin (activator of cAMP), epidermal growth factor (EGF) and phorbol ester PMA 
responses were determined in the absence and presence of 10 ng/ml TGF-pi . Forskolin had no 
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effect alone, and only slightly enhanced the TGF-pl -induced expression of Smad6 and Smad7 
mRNA. EGF (10 ng/ml) by itself was capable of inducing Smad6 and Smad7 expression, and 
acted synergistically with TGF-pi for the induction of Smad7 mRNA (Fig. 12A). EGF was 
added 60 min. before TGF-pl . PMA (10" 8 M) alone had no appreciable effect, but a strong 
5 synergistic response was observed when PMA was added together with TGF-pl on the induction 
of Smad6 and Smad7 expression (Fig. 12B). In this particular experiment the TGF-pi -mediated 
induction of Smad6 and Smad7 mRNA was very low, but could be observed upon longer 
exposure to X-ray film, thereby making synergistic effect more apparent. PAI-1 gene 
expression, which was found to be more rapidly induced by TGF-pi (peak at 90 min) than PMA 
10 (peak at 4 h), was included as a control to show effect of PMA and TGF-pi on cells. 

Example 14: Smad7 is More Potent than Smad8 In Inhibiting TGF-P-induced Responses 

Xenopus Smad8 shares 96% amino acid identity with murine Smad7 within the C-domain 
(Nakayama et al, 1998). Given that this domain is sufficient for receptor binding, and for 

15 inhibition of several TGF-p-induced responses, one would predict that these proteins would 
behave identically with regard to receptor binding and inhibition of downstream signaling. 
Consistent with this possibility, Smad8 interacted with the activated TpR-I, BMPR-IA, BMPR- 
IB and ActR-I. In addition, Smad9 inhibited BMPRI-mediated phosphorylation of Smadl and 
Smad5 in a dose-dependent fashion that was indistinguishable from that of murine Smad7. Fig 

20 13A shows that Smad8 inhibits T pR-I-mediated phosphorylation of Smad2 less efficiently than 
Smad7. Cells were transfected with F-Smad2 alone or together with F-smad7 or F-Smad8 in the 
presence of T PR-I and T pR-II; cells were then incubated in the absence or presence of TGF-p. 
The level of F-Smad2 phosphorylation was determined by [ 32 P]orthophosphate labeling and 
immunoprecipitation of cell lysates with Flag antisera. Expression of F-Smads was analysed by 

25 Western blotting with Flag antiserum on aliquots of cell lysates. Fig. 13B shows that 

transfection of Smad7 in MvlLu cells blocks the TGF-P-induced p3TPLux response more 
effectively than Smad8. Thus, while Smad7 efficiently inhibited TpR-I-mediated 
phosphorylation of Smad2, Smad8 was less effective at doing so (Fig. 13 A). Interestingly, this 
correlated with the more pronounced effect of Smad7 versus Smad8 in inhibiting TGF-P-induced 

30 transcriptional activation of the p3TPLux reporter gene (Fig. 13B). Of note, Smad7 is more 
effective than is Smad8 at inhibiting activin signaling in Xenopus mesoderm induction assay 
(Nakayama et al., 1998). 
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Example 15: Partial Secondary Axis Formation by Smad7 and Smad8 

To determine whether Smad7 can inhibit transduction of BMP signals in vzvo ? patterning 
defects caused by overexpression of Smad7 m Xenopus embryos were analyzed. Fig. 14A shows 
photomicrographs of control tadpole and Fig. 14B shows sibling tadpole made to misexpress 
5 Smad7 in ventral cells. Note induction of partial secondary dorsal axis (arrow). Fig. 14C and 
Fig. 14D (control) depict Smad7-RNA injected sibling showing immunoreactive muscle in 
primary (arrow) and secondary (arrowhead) axes. Fig. 14E shows eye defects and spina bifida in 
an embryo made to overexpress Smad8 in dorsal cells. Fig. 14A shows fusion of eyes in an 
embryo made to overexpress Smad7 in dorsal cells. Overexpression of known BMP antagonists, 

10 such as dominant negative BMP receptors or ligands, on the ventral side of Xenopus embryos 
can induce the formation of a partial secondary axis (reviewed inGraff Cell 89:171-174, 1997). 
Injection of 200 or 400 pg of RNA encoding Smad7 near the ventral midline of four-cell 
embryos led to formation of a secondary dorsal axis in 90% (n=100) or 96% (n=79) of embryos, 
respectively (Fig. 14B). Secondary axis formation was observed in 94% (n=98) of embryos 

15 injected with 200 pg of RNA encoding that isolated Odomain of Smad7. Axes induced by 
either Smad7 or the C-domain of Smad7 contained immunoreactive muscle (Fig. 14D) but 
lacked notochord. In the same series of experiments, injection of 200 or 400 pg of RNA 
encoding Xenopus Smad8 near the ventral midline induced the formation of a secondary axis in 
87 (n=92) or 91% (n=401) of injected embryos, respectively. One notable difference between 

20 the axes induced by the two gene products is that the secondary axes induced by Smad8 included 
a cyclopic or fused eye in 10% (200 pg) or 41% (400 pg) of cases whereas eye formation was 
rarely observed (3% of embryos) following misexpression of Smad7, and then only when a high 
dose (400 pg) of RNA was injected. 

25 Example 16: Ectopic Expression of Smad7 in Dorsal Cells Induces a Subset of the 
Patterning Defects Caused by Overexpression of Smad8 

Ectopic expression of Smad8 within dorsal cells of Xenopus embryos produces a range of 
phenotypic defects which cannot be attributed to blockade of BMP signaling. These include 
failure of the neural folds to fuse (spina bifida) as well as a range of eye defects (Nakayama et 
30 al-, 1998). To directly compare patterning defects cause by misexpression of Smad7 and Smad8 
within dorsal cells, 200 pg of RNA encoding either protein was injected near the dorsal midline 
of four-cell embryos. While 20% of Smad8-injected embryos developed spina bifida, and 40% 
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showed a decrease in eye pigmentation (n=50), 97% of Smad7-injected embryos developed 
completely normally (n=89). When a higher dose (400 pg) of RNA encoding either Smad8 or 
Smad7 was injected, 40% of Smad8 -injected embryos developed with spina bifida (Fig. 14E, 
n=124) while only 7% of embryos injected with Smad7 RNA (n=140), and 3% of embryos 
5 injected with a control RNA (encoding a myc epitope tag, n=70) displayed this defect. In 
contrast, 43% of Smad7-injected embryos developed with variable eye defects ranging from a 
slight to moderate decrease in eye pigment to fusion of the eyes (Fig. 14F). Forty-seven per cent 
of Smad8 -injected siblings developed with comparable eye defects (Nakayama et al., 1998). The 
percentage of Smad8-injected embryos displaying spina bifida, eye induction or eye defects was 

10 somewhat lower in these experiments than that previously reported (Nakayama et al., 1 998), 
possibly due to inherent differences between embryos used for the two sets of experiments. 
Together, these results suggest that Smad7 and Smad8 can interact with common signaling 
pathways in vivo although Smad8 may target additional, as of yet unidentified, pathways. 
Therefore, overexpression of Smad8 in Xenopus embryos produces patterning defects that are not 

15 observed following overexpression of Smad7, suggesting that Smad7 and Smad8 may 

preferentially target distinct signaling pathways. While it cannot be ruled out that the possibility 
that these differences are due to species specificity, e.g., the possibility that mammalian Smad7 is 
more efficiently translated, or interacts more efficiently with receptors in mammalian cells than 
does Xenopus Smad8, the results do not support this interpretation. Specifically, Smad7 is a 

20 more potent antagonist of TGF-p/activin signaling in both mammalian and Xenopus systems 
than is Smad8. 

Example 17: Smad7 but not Smad6 inhibits TGF-pl-induced growth inhibition. 

In order to investigate the effect of inhibitory Smads on TGF-pl -mediated growth 
25 inhibition, mouse Flag-tagged Smad6 (F-Smad6) and F-Smad7 were stably transfected into 
MvlLu cells; Smad6 and Smad7 were placed under transcriptional control of the human 
inducible metallothionien IIA promoter using the pMEP4 expression vector. Both reported 
forms of Smad6 were tested; a mouse long Smad6 version (Smad6L; Imamura et al., Nature 
398:622-626, 1997) as well as a human short Smad6 version (Smad6S; Topper et al., PNAS 
30 94:9314-9319, 1997). Fig. 15 shows characterization of Smad expression level in stable 

transfected MvlLu cell lines in the absence or presence of zinc chloride; wild-type F-Smad7 
(pMEP4-smad7; three independent clones, 7-3, 7-5 S and 7-10), the N-terminal domain of Smad7 
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(pMEP-7N, clone 7N-5), Smad7 with deletion at C-tail (pMEP4-7NCA, clone 7NCA-7), the C- 
terminal domain of Smad7 (pMEP4-7CA, clone 7CA-7) and the "short" variant of Smad6 
(pMEP4-Smad6S, clone S6-2)and the "long" variant of Smad6 (pMEP4-Smad6L, clone L6-3) 
expression results are shown. Cells were metabolically labeled, and cell lysates subjected to 
5 immunoprecipitation with Flag antibody. Immunoprecipitates were analyzed by SDS-PAGE. 
When the expression of Smad6S 5 Smad6L and Smad7 was analyzed, we observed that the 
inhibitory Smads were already expressed in the absence of the inducer zinc chloride. This is 
result of leakage of the metallothionien promoter. Pretreatment of cells with zinc chloride in 
each transfectant led to increase in Smad6 and Smad7 expression (Fig. 15). Subsequently, we 

10 tested the response of the Smad7 and Smad7 (and empty pMEP4) transfectants for TGF-pl - 
induced growth inhibition in the absence or presence of zinc chloride. Multiple independent 
Smad7-expressing clones were analyzed. We found that Smad7 prevented the TGF- pi -induced 
growth-inhibitory effects in MvlLu (Fig. 16A-C). In all Smad7 clones TGF-p 1 -induced growth 
inhibition was blocked to an extent which correlated with their different expression levels of 

15 Smad7. pMEP4-transfected MvlLu cells, in the absence or presence of zinc chloride showed a 
similar dose response curve for TGF-pl as nontransfected MvlLu cells. Ectopic expression 
Smad6L or Smad6S did not affect the TGF-pl -induced growth inhibition (Fig. 16D,E). Fig 16 
shows the effect of TGF-pl on pMEP4 transfected cells in the absence or presence of zinc 
chloride. The relative growth compared to control is plotted against the concentration of TGF- 

20 pi. Different lots of TGF-pl were used in experiments A-C versus D,E, explaining why there is 
a difference between the ED50's on pMEP4 transfected cells in these experiments. All data 
shown are means +/-SED. Thus Smad7 can antagonize TGF-pl -induced growth inhibition, and 
appears to be more effective than Smad6. 

25 Example 18: Smad7 inhibits TGF-pl -induced transcriptional responses. 

Subsequently we examined the effect of ectopic Smad7 expression on TGF-pl -induced 
expression of the endogenous early response genes, Smad7, PAI-I and JunB (Fig. 17). In Fig. 17 
it is shown that ectopically expressed Smad7 inhibits TGF-pi -mediated induction of JUNB (A), 
Smad7 (B) and PAI-1 mRNA (E) expression. Northern blot analysis on RNA from MvlLu cells 
30 stably transfected with pMEP4-Smad7 without or with pretreatment with zinc chloride on 
exposure to TGF-pl. The endogenous mRNAs for junB, Smad7 and PAI-1 are indicated by 
arrows. The asterisk in Smad7 blot indicates the transfected F-Smad7 mRNA. The amount and 
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intactness of total RNA loaded was checked by ethidium bromide staining (C and F). For junB 
(A) and Smad7 probe (B) the same blot was used, which was different from blot for PAI-1 1 
probe (E). After treatment of cells with zinc chloride for 20 h prior to TGF-pi stimulation for 2 
h, the ectopically induced Smad7 was found to inhibit TGF-pl -induced immediate early gene 

5 responses at a Smad7 mRNA extopic expression level similar to that of endogenous gene after 
TGF-p treatment (Fig. 17B). Interestingly, upon treatment with zinc chloride for 6h, no 
significant effect on TGF-P 1 stimulation of early response genes in the Smad7 expressing clones 
was observed, whereas the apparent ectopic Smad7 mRNA levels were higher after 6 h versus 
20h pretreatment. Possibly differences in Smad7 mRNA do not correlate exactly with Smad7 

10 protein levels upon zinc chloride treatment, or a particular post-translationally modification that 
occurs with slow kinetics may be required for interaction of Smad7 with receptor in MvlLu 
cells. 



Example 19: TGF-pi-induced nuclear export of Smad7 in transfected COS cells and 
15 nontransfected MvlLu cells 

The subcellular localization of F-Smad6, F-Smad7 and F-Smad2 in transfected COS cells 
was analyzed in the absence or presence of the constitutive active TpR-I(T204D) by 
immunofluorescence using Flag antibody (Fig. 18A). In Fig. 18, Panel A shows the subcellular 
distribution of F-Smad7, F-Smad7 and F-Smad2 in the absence or presence of constitutive TpR- 

20 I(T204D) mutant in transfected COS cells. Smads were localized in the cells by 

immunofluorescence using Flag antibody. Panel B shows the quantitation of nuclear versus 
cytoplasmic cell staining for F-smads in the absence or presence of T PR-I(T204D). Panel C 
shows the subcellular distribution of Smad7 and Smad6 in nontransfected MvlLu cells in the 
absence and presence of TGF-pi. The Smads were localized in the cells using specific antisera 

25 to Smad6 (ESP) or Smad7 (KER). Smad6 was mainly located in the cytoplasm of the cells and 
the subcellular localization was not different from Smad6 cotransfection with Tp R-I(T204D) ? 
constitutively active TpR-1 . Smad7 was located in the cell nuclei but exported in a large 
proportion of total treated cells from the nuclei to the cytoplasm upon cotransfection with T pR- 
I(T204D), which is in agreement with earlier reports (Eppert et al. 1996; Nakao et al., 1997). 

30 The quantitation of nuclear versus cytoplasmic staining for the different Smads in the absence or 
presence of TpR-I(T204D) are presented in Fig. 18B. The nuclear localization was checked by 
DAPI staining. The observed differences in subcellular distribution of Smad7 versus Smad2 and 
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Smad6, suggest a functional significance of TpR-I-mediated export of Smad7 from the nucleus. 
However, over expression in COS cells and use of epitope tag may affect Smad7 subcellular 
distribution. Therefore, the subcellular distribution of inhibitory Smads in nontransfected 
MvlLu cells using antisera that specifically recognize Smad6 and Smad7 was also examined. In 
5 agreement with results in transfected COS cells, a cytoplasmic staining for Smad6 in the absence 
or presence of TGF-pl (or BMP-7) was observed. In some experiments, a TGF-pl -induced 
nuclear translocation was observed in some of the cells (Fig. 18C). In the absence of ligand, a 
nuclear staining was observed for Smad7. However, upon TGF-pl stimulation a cytoplasmic 
accumulation of Smad7 was induced (Fig. 18C). Two different antisera for each of Smad6 
10 (recognizing Smad6L but not Smad6S) and Smad7, raised against different epitopes, gave 
identical results. 

Example 20: Differential localization of Smad7 deletion mutants in the absence or presence 
ofTGF-pi 

15 The subcellular localization of F-Smad6 and F-Smad7 in MvlLu cells stably expressing 

F-Smad7 and F-Smad7 were analyzed in the absence or presence TGF-pi using the Flag 
antibody and immunofluorescence. In Fig. 19, the subcellular distribution of Smad7 mutants in 
the absence or presence of TGF-pi is shown. The cells were pretreated with zinc chloride. 
Smads were localized in the cells using Flag antibody. Identical results were obtained, compared 

20 to the experiments in which transfected COS cells and nontransfected MvlLu cells were used; 
cytoplasmic staining for F-Smad6L and Smad6S versus a TGF-pi -induced nuclear export for F- 
Smad7 (Fig. 19). To gain more insight into regions in Smad7 that are important for nuclear 
localization and TGF-pl -induced nuclear export, the subcellular distribution of different Smad7 
deletion mutants were analyzed. pMEP4 expression constructs for the C-terminal domain of F- 

25 Smad7 (7C; amino acids 204-426). F-Smad7C with C-tail deletion (7CA; amino acids 204-407), 
Smad7 with deletion of C-tail (7A; amino acids 1-407), and N-terminal domain of Smad7 (7N; 
amino acids 1-203) were stably transfected into MvlLu cells, and cell lines characterized for 
Smad7 expression upon zinc chloride treatment (Fig. 15). All cells showed some degree of leaky 
expression, but in all cell lines zinc treatment included expression of the Smad protein. 

30 Subcellular distribution of the 7C mutant (after zinc chloride treatment) in the absence or 

presence of TGF-pl was similar to that of wild-type Smad7, although its efficiency of nuclear 
export was much less than that of wild-type Smad7. An intact MH2 domain appeared important 
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for nuclear localization as 7CA and 7A mutants were predominantly in cytoplasm in the absence 
of TGF-p 1 . Smad7N was found in the cytoplasm in the absence or presence of TGF-0 1 ; it had a 
spotted localization suggesting association with a particular cell structure (Fig. 19). None of the 
Smad7 deletion constructs were able to interfere with TGF-p 1 -induced growth inhibition as 
5 observed for Smad7 wild-type. 

Example 21: Characterization of the Mouse Smad7 Promoter 

The mouse Smad7 promoter was isolated from a commercially available genomic DNA 
10 library of the 129/Sv mouse strain (Stratagene, La Jolla, CA), using a ~800bp Smad7 cDNA 
fragment as a probe (nucleotides 883-1614 of SEQ ID NO. 3). 

The putative Smad7 promoter that was isolated comprised of a -4,46 5 bp Xlio I-Xho I 
fragment from the 5'-end of the Smad7 mouse genomic DNA. A single Bam HI site was 
identified by restriction mapping ~3,840bp from the 5'-Xho I site. In order to determine the 
15 promoter properties of the ~4,465bp Xho I-Xho I DNA, three different regions of this DNA 
fragment (i.e., ~4,465bp Xho I-Xho I, ~725bp Bam Hl-Xho I, and ~3,840bp Xho I-Bam HI) 
were inserted 5' of the promoter-less luciferase reporter gene in the commercially available 
pGL3 -Basic expression vector (Promega, Madison, WI) and transfected into HepG2 cells. 
Following transfection, the cells were starved overnight in DMEM (0.3% FBS) and later 
20 stimulated with 10 ng/ml of TGF-p 1 or vehicle (negative control) for 16 hours. The cells were 
lysed and subjected to luciferase analysis. These results, depicted in Figure 21, show that both 
the ~4,465bp Xho I-Xho I and the ~725bp Bam Hl-Xho I Smad7 DNA fragments but not the 
~3,840bp Xho I-Bam HI fragment, confer TGF-p 1 inducibility to the heterologous luciferase 
transcription unit. 

25 The 725bp Bam Hl-Xho I Smad7 promoter fragment was further characterized. Its 

sequence is depicted as SEQ ID NO. 15 of the sequence listing. The transcriptional start site was 
localized by an RNAase Pretection Assay and was localized at - nucleotide 499 of SEQ ID NO. 
15. This start site was also found to coincide with the predicted start site as determined by 
computer software programs searching for such sites (Wingender, E. et al., Biotechnology, 1994, 

30 35:273-280; Prestidge, D. et al., J Mol Biol, 1995, 249:923-932). Figure 22 depicts several 
restriction enzyme sites and putative binding sites for different transcription factors and Smad 
proteins in the Bam Hl-Xho I promoter fragment. The putative binding sites for non-Smad 
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transcription factors were identified using the well known in the art TFSEARCH computer 
program and the TRANSFAC MATRIX TABLE database. Putative Smad binding regions were 
assessed using sequence information from the literature. 

All references disclosed herein are incorporated by reference in their entirety. 
5 A Sequence Listing is presented followed by what is claimed: 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION 

(i) APPLICANT: 

(A) NAME: LUDWIG INSTITUTE FOR CANCER RESEARCH 

(B) STREET: 1345 AVENUE OF THE AMERICAS 

(C) CITY: NEW YORK 

(D) STATE: NEW YORK 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE: 10105 

(ii) TITLE OF THE INVENTION: SMAD7 AND USES THEREOF 
(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Wolf, Greenfield & Sacks, P.C. 

(B) STREET: 600 Atlantic Avenue 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: U.S.A. 

(F) ZIP: 02210-2211 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 20-MAY-1998 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/047,221 

(B) FILING DATE: 20-MAY-1997 

(A) APPLICATION NUMBER: 60/060,465 

(B) FILING DATE: 30-SEP-1997 

(A) APPLICATION NUMBER: 60/075,940 

(B) FILING DATE: 25-FEB-1998 

(A) APPLICATION NUMBER: 60/077,033 

(B) FILING DATE: 06-MAR-1998 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Van Amsterdam, John R. 

(B) REGISTRATION NUMBER: 4 0,212 

(C) REFERENCE/DOCKET NUMBER: L0461/7032 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-720-3500 

(B) TELEFAX: 617-720-2441 
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(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



CAGCAACCAT GGACGGGTTT CACCGTGCAG ATCAGCTTTG TGAAGGGCTG GGGCCAGTGC 6 0 

TACACCCGCC AGTTCATCAG CAGCTGCCCG TGCTGGCTGG AGGTCATCTT CAACAGCCGG 12 0 

TAGTCGGTCG TGTGGTGGGG AGAAGAGGAC AGGGCGGATC GTGAGCCGAG CAGGCCACCG 18 0 

TTCAAACTAC TTGCTGCTAA CCTTTCCCGA GTGATTGCTT TTCATGCAAA CTCTTTGGTT 24 0 

GGTGTTGTTA TTGCCATTCA TTGTTGGTTT TGTTTTGTTC TGTTCTGGGT TCAGCCTGGC 300 

CTGCCCAGCC CTGGTCATCC AGCTGTACTG GCCCCTGGGG GGGTTCTGAG CAGTGCCTCT 3 60 

TGTCTTGGAG AC AGAC CTGG TGCCTGGCGC TGCCTTCAGC AGGCAGCAGG CAGCCTCCTG 42 0 

CACACTGGCT TTTTTAGTCA TTTATGGGCA AAAAGAGTTA AAGTAAAACT TTGCAAATCC 4 80 



(2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

ACTCTAGTTC ACAGAGTCGA CTAAGGTGAT GGGGGTTGCA GCACACCAGC TCGGGGTTGA 6 0 

TCTTCCCGTA AGATTCACAG CAACACAGCC TCTTGACTTC CGAGGAATGC CTGAGATCCG 12 0 

GCCACCTGAA CACTTTGCAC AGCAGGAGGG GGAGCGAGTA GGACGAGGGC GGCTGCACTG 18 0 

GCTGCCGCTG GCGGGCACGC CCGGGCCCAG CCTGCAGTCC AGGCGGCCGG GCAGCAGGAG 24 0 

GCACGCGGTG CGCGTACCGC CGCGGGACTC CACGGCCTGA AGCAGCAGCT CCAGC 2 95 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1945 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 308 . . . 1585 
(D) OTHER INFORMATION: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CGTTCCTCCA GGGTCACGCC GGGGCCCGAA AGCCGCGCAG GGCGCGGGCC GCGCCGGGTG 6 0 

GGGCATCCGA AGCGCAGCCC CCCGATCCCC GGCAGGCGCC CTGGGCCCCC GCGCGCGCCC 12 0 

CGGCCTCTGG GAGATGGCGC ATGC CAGG AG GGCCCCTCCG GCCGCCGCCG GTTCTGCCCG 18 0 

GGCCCCTGCT GTTGCTGCTG TCGCCTGCGC CTGNTGCCCC AAGTCGGCGC CCGATTNTTC 24 0 

ATGGTGTGCG GAGGTCATGT TCGCTCCTTA GCCGGCAAAC GATTTTCTCC TCGCCTCCTC 3 00 

GCCCCGC ATG TTC AGG ACC AAA CGA TCT GCG CTC GTC CGG CGT CTC TGG 34 9 
Met Phe Arg Thr Lys Arg Ser Ala Leu Val Arg Arg Leu Trp 
15 10 

AGG AGC CGT GCG CCC GGC GGC GAG GAC GAG GAG GAG GGC GTG GGG GGT 3 97 

Arg Ser Arg Ala Pro Gly Gly Glu Asp Glu Glu Glu Gly Val Gly Gly 
15 20 25 30 

GGC GGC GGA GGA GGC GAG CTG CGG GGA GAA GGG GCG ACG GAC GGC CGG 44 5 

Gly Gly Gly Gly Gly Glu Leu Arg Gly Glu Gly Ala Thr Asp Gly Arg 
35 40 45 

GCT TAT GGG GCT GGT GGC GGC GGT GCG GGC AGG GCT GGC TGC TGC CTG 4 93 

Ala Tyr Gly Ala Gly Gly Gly Gly Ala Gly Arg Ala Gly Cys Cys Leu 
50 55 60 

GGC AAG GCA GTC CGA GGT GCC AAA GGT CAC CAC CAT CCC CAT CCC CCA 541 
Gly Lys Ala Val Arg Gly Ala Lys Gly His His His Pro His Pro Pro 
65 70 75 

ACC TCG GGT GCC GGG GCG GCC GGG GGC GCC GAG GCG GAT CTG AAG GCG 589 
Thr Ser Gly Ala Gly Ala Ala Gly Gly Ala Glu Ala Asp Leu Lys Ala 
80 85 90 

CTC ACG CAC TCG GTG CTC AAG AAA CTC AAG GAG CGG CAG CTG GAG CTG 63 7 

Leu Thr His Ser Val Leu Lys Lys Leu Lys Glu Arg Gin Leu Glu Leu 
95 100 105 110 

CTG CTT CAG GCC GTG GAG TCC CGC GGC GGT ACG CGC ACC GCG TGC CTC 685 
Leu Leu Gin Ala Val Glu Ser Arg Gly Gly Thr Arg Thr Ala Cys Leu 
115 120 125 

CTG CTG CCC GGC CGC CTG GAC TGC AGG CTG GGC CCG GGG GCG CCC GCC 73 3 

Leu Leu Pro Gly Arg Leu Asp Cys Arg Leu Gly Pro Gly Ala Pro Ala 
130 135 140 

AGC GCG CAG CCC GCG CAG CCG CCC TCG TCC TAC TCG CTC CCC CTC CTG 781 
Ser Ala Gin Pro Ala Gin Pro Pro Ser Ser Tyr Ser Leu Pro Leu Leu 
145 150 155 

CTG TGC AAA GTG TTC AGG TGG CCG GAT CTC AGG CAT TCC TCG GAA GTC 82 9 

Leu Cys Lys Val Phe Arg Trp Pro Asp Leu Arg His Ser Ser Glu Val 
160 165 170 

AAG AGG CTG TGT TGC TGT GAA TCT TAC GGG AAG ATC AAC CCC GAG CTG 877 
Lys Arg Leu Cys Cys Cys Glu Ser Tyr Gly Lys lie Asn Pro Glu Leu 
175 180 185 190 



GTG TGC TGC AAC CCC CAT CAC CTT AGT CGA CTC TGT GAA CTA GAG TCT 



925 



WO 98/53068 PCT7US98/10305 

- 66 - . _ 

Val Cys Cys Asn Pro His His Leu Ser Arg Leu Cys Glu Leu Glu Ser 
195 200 205 

CCC CCT CCT CCT TAC TCC AGA TAC CCA ATG GAT TTT CTC AAA CCA ACT 973 
Pro Pro Pro Pro Tyr Ser Arg Tyr Pro Met Asp Phe Leu Lys Pro Thr 
210 215 220 

GCA GGC TGT CCA GAT GCT GTA CCT TCC TCC GCG GAA ACC GGG GGA ACG 1021 
Ala Gly Cys Pro Asp Ala Val Pro Ser Ser Ala Glu Thr Gly Gly Thr 
225 230 235 

AAT TAT CTG GCC CCT GGG GGG CTT TCA GAT TCC CAA CTT CTT CTG GAG 106 9 

Asn Tyr Leu Ala Pro Gly Gly Leu Ser Asp Ser Gin Leu Leu Leu Glu 
240 245 250 

CCT GGG GAT CGG TCA CAC TGG TGC GTG GTG GCA TAC TGG GAG GAG AAG 1117 
Pro Gly Asp Arg Ser His Trp Cys Val Val Ala Tyr Trp Glu Glu Lys 
255 260 265 270 

ACT CGC GTG GGG AGG CTC TAC TGT GTC CAA GAG CCC TCC CTG GAT ATC 1165 
Thr Arg Val Gly Arg Leu Tyr Cys Val Gin Glu Pro Ser Leu Asp lie 
275 280 285 

TTC TAT GAT CTA CCT CAG GGG AAT GGC TTT TGC CTC GGA CAG CTC AAT 1213 
Phe Tyr Asp Leu Pro Gin Gly Asn Gly Phe Cys Leu Gly Gin Leu Asn 
290 295 300 

TCG GAC AAC AAG AGT CAG CTG GTA CAG AAA GTG CGG AGC AAG ATC GGC 12 61 

Ser Asp Asn Lys Ser Gin Leu Val Gin Lys Val Arg Ser Lys lie Gly 
305 310 315 

TGT GGC ATC CAG CTG ACG CGG GAA GTG GAT GGC GTG TGG GTT TAC AAC 13 09 

Cys Gly lie Gin Leu Thr Arg Glu Val Asp Gly Val Trp Val Tyr Asn 
320 325 330 

CGC AGC AGT TAC CCC ATC TTC ATC AAG TCC GCC ACA CTG GAC AAC CCG 13 57 

Arg Ser Ser Tyr Pro lie Phe lie Lys Ser Ala Thr Leu Asp Asn Pro 
335 340 345 350 

GAC TCC AGG ACG CTG TTG GTG CAC AAA GTG TTC CCT GGT TTC TCC ATC 14 05 

Asp Ser Arg Thr Leu Leu Val His Lys Val Phe Pro Gly Phe Ser lie 
355 360 365 

AAG GCT TTT GAC TAT GAG AAA GCC TAC AGC CTG CAG CGG CCC AAT GAC 14 53 

Lys Ala Phe Asp Tyr Glu Lys Ala Tyr Ser Leu Gin Arg Pro Asn Asp 
370 375 380 

CAC GAG TTC ATG CAG CAA CCA TGG ACG GGT TTC ACC GTG CAG ATC AGC 1501 
His Glu Phe Met Gin Gin Pro Trp Thr Gly Phe Thr Val Gin lie Ser 
385 390 395 

TTT GTG AAG GGC TGG GGC CAG TGC TAC ACC CGC CAG TTC ATC AGC AGC 154 9 

Phe Val Lys Gly Trp Gly Gin Cys Tyr Thr Arg Gin Phe lie Ser Ser 
400 405 410 



TGC CCG TGC TGG CTG GAG GTC ATC TTC AAC AGC CGG TAGTCGGTCG TGTGGT 16 01 
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Cys Pro Cys Trp Leu Glu Val lie Phe Asn Ser Arg 
415 420 425 

GGGGAGAAGA GGACAGGGCG GATCGTGAGC CGAGCAGGCC ACCGTTCAAA CTACTTGCTG 1661 

CTAACCTTTC CCGAGTGATT GCTTTTCATG CAAACTCTTT GGTTGGTGTT GTTATTGCCA 1721 

TTCATTGTTG GTTTTGTTTT GTTCTGTTCT GGGTTCAGCC TGGCCTGCCC AGCCCTGGTC 1781 

ATCCAGCTGT ACTGGCCCCT GGGGGGGTTC TGAGCAGTGC CTCTTGTCTT GGAGACAGAC 1841 

CTGGTGCCTG GCGCTGCCTT CAGCAGGCAG CAGGCAGCCT CCTGCACACT GGCTTTTTTA 1901 
GTCATTTATG GGCAAAAAGA GTTAAAGTAA AACTTTGCAA ATCC 194 5 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Met 


Phe 


Arg 


Thr 


Lvs 


Arg 


Ser 


Ala 


Leu 


Val 


Arg 


Arg 


Leu 


Trp 


Arg 


Ser 


1 








5 










10 










15 




Arg 


Ala 


Pro 


Glv 


Glv 


Glu 


Asp 


Glu 


Glu 


Glu 


Gly 


Val 


Gly Gly 


Gly Gly 








20 










25 










30 






Gly 


Gly 


Gly 


Glu 


Leu 


Arg 


Gly 


Glu 


Gly 


Ala 


Thr 


Asp 


Gly Arg Ala 


Tyr 






35 










40 










45 








Gly 


Ala 


Gly 


Gly 


Gly 


Gly 


Ala 


Gly 


Arg 


Ala 


Gly 


Cys 


Cys 


Leu 


Gly Lys 




50 










55 










60 










Ala 


Val 


Arg 


Gly 


Ala 


Lys 


Gly 


His 


His 


His 


Pro 


His 


Pro 


Pro 


Thr 


Ser 


65 










70 










75 










80 


Gly 


Ala 


Gly 


Ala 


Ala 


Gly 


Gly 


Ala 


Glu 


Ala 


Asp 


Leu 


Lys 


Ala 


Leu 


Thr 










85 










90 










95 




His 


Ser 


Val 


Leu 


Lys 


Lys 


Leu 


Lys 


Glu 


Arg 


Gin 


Leu 


Glu 


Leu 


Leu 


Leu 








100 










105 










110 






Gin 


Ala 


Val 


Glu 


Ser 


Arg 


Gly 


Gly 


Thr 


Arg 


Thr 


Ala 


Cys 


Leu 


Leu 


Leu 






115 










120 










125 








Pro 


Gly 


Arg 


Leu 


Asp 


Cys 


Arg 


Leu 


Gly 


Pro 


Gly 


Ala 


Pro 


Ala 


Ser 


Ala 




130 










135 










140 










Gin 


Pro 


Ala 


Gin 


Pro 


Pro 


Ser 


Ser 


Tyr 


Ser 


Leu 


Pro 


Leu 


Leu 


Leu 


Cys 


145 










150 










155 










160 


Lys 


Val 


Phe 


Arg 


Trp 


Pro 


Asp 


Leu 


Arg 


His 


Ser 


Ser 


Glu 


Val 


Lys 


Arg 










165 










170 










175 




Leu 


Cys 


Cys 


Cys 


Glu 


Ser 


Tyr 


Gly 


Lys 


He 


Asn 


Pro 


Glu 


Leu 


Val 


Cys 








180 










185 










190 






Cys 


Asn 


Pro 


His 


His 


Leu 


Ser 


Arg 


Leu 


Cys 


Glu 


Leu 


Glu 


Ser 


Pro 


Pro 






195 










200 










205 








Pro 


Pro 


Tyr 


Ser 


Arg 


Tyr 


Pro 


Met 


Asp 


Phe 


Leu 


Lys 


Pro 


Thr 


Ala 


Gly 




210 










215 










220 










Cys 


Pro 


Asp 


Ala 


Val 


Pro 


Ser 


Ser 


Ala 


Glu 


Thr 


Gly 


Gly Thr Asn 


Tyr 


225 










230 










235 










240 


Leu 


Ala 


Pro 


Gly 


Gly 


Leu 


Ser 


Asp 


Ser 


Gin 


Leu 


Leu 


Leu 


Glu 


Pro 


Gly 










245 










250 










255 




Asp 


Arg 


Ser 


His 


Trp 


Cys 


Val 


Val 


Ala 


Tyr 


Trp 


Glu 


Glu 


Lys 


Thr 


Arg 
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260 










265 










270 






Val 


Gly 


Arg 
275 


Leu 


Tyr 


Cys 


Val 


Gin 
280 


Glu 


Pro 


Ser 


Leu 


Asp 
285 


He 


Phe 


Tyr 


Asp 


Leu 


Pro 


Gin 


Gly 


Asn 


Gly 


Phe 


Cys 


Leu 


Gly Gin Leu 


Asn 


Ser 


Asp 




290 










295 










3 00 










Asn 


Lys 


Ser 


Gin 


Leu 


Val 


Gin 


Lys 


Val 


Arg 


Ser 


Lys 


He 


Gly 


Cys 


Gly 


305 










310 










315 










320 


He 


Gin 


Leu 


Thr 


Arg 
325 


Glu 


Val 


Asp 


Gly 


Val 
330 


Trp 


Val 


Tyr 


Asn 


Arg 
335 


Ser 


Ser 


Tyr 


Pro 


He 
340 


Phe 


He 


Lys 


Ser 


Ala 
345 


Thr 


Leu 


Asp 


Asn 


Pro 
350 


Asp 


Ser 


Arg 


Thr 


Leu 
355 


Leu 


Val 


His 


Lys 


Val 
360 


Phe 


Pro 


Gly 


Phe 


Ser 
365 


He 


Lys 


Ala 


Phe 


Asp 
370 


Tyr 


Glu 


Lys 


Ala 


Tyr 
375 


Ser 


Leu 


Gin 


Arg 


Pro 
380 


Asn 


Asp 


His 


Glu 


Phe 


Met 


Gin 


Gin 


Pro 


Trp 


Thr 


Gly 


Phe 


Thr 


Val 


Gin 


He 


Ser 


Phe 


Val 


385 










390 










395 










400 


Lys 


Gly 


Trp 


Gly 


Gin 
405 


Cys 


Tyr 


Thr 


Arg 


Gin 
410 


Phe 


He 


Ser 


Ser 


Cys 
415 


Pro 


Cys 


Trp 


Leu 


Glu 
420 


Val 


He 


Phe 


Asn 


Ser 
425 


Arg 















(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 50 . . . 1327 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

GAATTCGGCA CGAGGGCAAA CGACTTTTCT CCTCGCCTCC TCGCCCCGC ATG TTC AGG 5 8 

Met Phe Arg 
1 

ACC AAA CGA TCT GCG CTC GTC CGG CGT CTC TGG AGG AGC CGT GCG CCC 106 
Thr Lys Arg Ser Ala Leu Val Arg Arg Leu Trp Arg Ser Arg Ala Pro 
5 10 15 

GGC GGC GAG GAC GAG GAG GAG GGC GCA GGG GGA GGT GGA GGA GGA GGC 154 
Gly Gly Glu Asp Glu Glu Glu Gly Ala Gly Gly Gly Gly Gly Gly Gly 
20 25 30 35 

GAG CTG CGG GGA GAA GGG GCG ACG GAC AGC CGA GCG CAT GGG GCC GGT 2 02 

Glu Leu Arg Gly Glu Gly Ala Thr Asp Ser Arg Ala His Gly Ala Gly 
40 45 50 



GGC GGC GGC CCG GGC AGG GCT GGA TGC TGC CTG GGC AAG GCG GTG CGA 



250 
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Gly Gly Gly Pro Gly Arg Ala Gly Cys Cys Leu Gly Lys Ala Val Arg 
55 60 65 

GGT GCC AAA TGT CAC CAC CAT CCC CAC CCG CCA GCC GCG GGC GCC GGC 298 
Gly Ala Lys Cys His His His Pro His Pro Pro Ala Ala Gly Ala Gly 
70 75 80 

GCG GCC GGG GGC GCC GAG GCG GAT CTG AAG GCG CTC ACG CAC TCG GTG 346 
Ala Ala Gly Gly Ala Glu Ala Asp Leu Lys Ala Leu Thr His Ser Val 
85 90 95 

CTC AAG AAA CTG AAG GAG CGG CAG CTG GAG CTG CTG CTC CAG GCC GTG 3 94 

Leu Lys Lys Leu Lys Glu Arg Gin Leu Glu Leu Leu Leu Gin Ala Val 
100 105 110 115 

GAG TCC CGC GGC GGG ACG CGC ACC GCG TGC CTC CTG CTG CCC GGC CGC 442 
Glu Ser Arg Gly Gly Thr Arg Thr Ala Cys Leu Leu Leu Pro Gly Arg 
120 125 130 

CTG GAC TGC AGG CTG GGC CCG GGG GCG CCC GCC GGC GCG CAG CCT GCG 490 
Leu Asp Cys Arg Leu Gly Pro Gly Ala Pro Ala Gly Ala Gin Pro Ala 
135 140 145 

CAG CCG CCC TCG TCC TAC TCG CTC CCC CTC CTG CTG TGC AAA GTG TTC 53 8 

Gin Pro Pro Ser Ser Tyr Ser Leu Pro Leu Leu Leu Cys Lys Val Phe 
150 155 160 

AGG TGG CCG GAT CTC AGG CAT TCC TCG GAA GTC AAG AGG CTG TGT TGC 586 
Arg Trp Pro Asp Leu Arg His Ser Ser Glu Val Lys Arg Leu Cys Cys 
165 170 175 

TGT GAA TCT TAC GGG AAG ATC AAC CCC GAG CTG GTG TGC TGC AAC CCC 634 
Cys Glu Ser Tyr Gly Lys lie Asn Pro Glu Leu Val Cys Cys Asn Pro 
180 185 190 195 

CAT CAC CTT AGC CGA CTC TGC GAA CTA GAG TCT CCC CCC CCT CCT TAC 6 82 

His His Leu Ser Arg Leu Cys Glu Leu Glu Ser Pro Pro Pro Pro Tyr 
200 205 210 

TCC AGA TAC CCG ATG GAT TTT CTC AAA CCA ACT GCA GAC TGT CCA GAT 73 0 

Ser Arg Tyr Pro Met Asp Phe Leu Lys Pro Thr Ala Asp Cys Pro Asp 
215 220 225 

GCT GTG CCT TCC TCC GCT GAA ACA GGG GGA ACG AAT TAT CTG GCC CCT 778 
Ala Val Pro Ser Ser Ala Glu Thr Gly Gly Thr Asn Tyr Leu Ala Pro 
230 235 240 

GGG GGG CTT TCA GAT TCC CAA CTT CTT CTG GAG CCT GGG GAT CGG TCA 82 6 

Gly Gly Leu Ser Asp Ser Gin Leu Leu Leu Glu Pro Gly Asp Arg Ser 
245 250 255 

CAC TGG TGC GTG GTG GCA TAC TGG GAG GAG AAG ACG AGA GTG GGG AGG 874 
His Trp Cys Val Val Ala Tyr Trp Glu Glu Lys Thr Arg Val Gly Arg 
260 265 270 275 



CTC TAC TGT GTC CAG GAG CCC TCT CTG GAT ATC TTC TAT GAT CTA CCT 



922 
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Leu Tyr Cys Val Gin Glu Pro Ser Leu Asp lie Phe Tyr Asp Leu Pro 
280 285 290 



CAG GGG AAT GGC TTT TGC CTC GGA CAG CTC AAT TCG GAC AAC AAG AGT 970 
Gin Gly Asn Gly Phe Cys Leu Gly Gin Leu Asn Ser Asp Asn Lys Ser 
295 300 305 



CAG CTG GTG CAG AAG GTG CGG AGC AAA ATC GGC TGC GGC ATC CAG CTG 1018 
Gin Leu Val Gin Lys Val Arg Ser Lys lie Gly Cys Gly lie Gin Leu 
310 315 320 



ACG CGG GAG GTG GAT GGT GTG TGG GTG TAC AAC CGC AGC AGT TAC CCC 10 6 6 

Thr Arg Glu Val Asp Gly Val Trp Val Tyr Asn Arg Ser Ser Tyr Pro 
325 330 335 



ATC TTC ATC AAG TCC GCC ACA CTG GAC AAC CCG GAC TCC AGG ACG CTG 1114 

lie Phe lie Lys Ser Ala Thr Leu Asp Asn Pro Asp Ser Arg Thr Leu 
340 345 350 355 

TTG GTA CAC AAG GTG TTC CCC GGT TTC TCC ATC AAG GCT TTC GAC TAC 1162 

Leu Val His Lys Val Phe Pro Gly Phe Ser lie Lys Ala Phe Asp Tyr 

360 365 370 



GAG AAG GCG TAC AGC CTG CAG CGG CCC AAT GAC CAC GAG TTT ATG CAG 1210 
Glu Lys Ala Tyr Ser Leu Gin Arg Pro Asn Asp His Glu Phe Met Gin 
375 380 385 



CAG CCG TGG ACG GGC TTT ACC GTG CAG ATC AGC TTT GTG AAG GGC TGG 12 5 8 

Gin Pro Trp Thr Gly Phe Thr Val Gin lie Ser Phe Val Lys Gly Trp 
390 395 400 



GGC CAG TGC TAC ACC CGC CAG TTC ATC AGC AGC TGC CCG TGC TGG CTA 13 0 6 

Gly Gin Cys Tyr Thr Arg Gin Phe lie Ser Ser Cys Pro Cys Trp Leu 
405 410 415 



GAG GTC ATC TTC AAC AGC CGG TAGCCGCGTG CGGAGGGGAC AGAGCGTGAG CTGA 13 61 
Glu Val lie Phe Asn Ser Arg 
420 425 



GCAGGCCACA CTTCAAACTA CTTTGCTGCT AATATTTTCC TCCTGAGTGC TTGCTTTTCA 1421 

TGCAAACTCT TTGGTCGTTT TTTTTTTGTT TGTTGGTTGG TTTTCTTCTT CTCGTCCTCG 14 81 

TTTGTGTTCT ATTTTTCTAA CTACAAAGGT TTAAATGAAC AAGAGAAGCA TCTCTCATTG 1541 

GAAATTTAGC ATTGTAGTGC TTTGAGAGAG AAAGGACTCC CTGNAAAAAA ACCTGAGATT 1601 

TATTAAAGNA AAAAATGTAT TTTATGTTAT ATATAAATAT ATTATTACTT GTAAATATAA 1661 

AGACGTTTTA TAAGCATCAT TATTTATGTA TTGTGCAATG TGTATAAACN AGNAAAATAA 1721 

AGAAAAGATG CACTTTGCTT TAATATAAAT GCAAATAACA AATGCCAAAT TAAAAAAGAT 1781 

AAACACAAGA TTGGTGTTTT TTTCTATGGG TGTTATCACC TAGCNGAATG TTTTTCTAAA 1841 

GGAGTTTATG TTCCATTAAA CGATTTTTAA AANGT 18 76 



(2) INFORMATION FOR SEQ ID NO : 6 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



Met 


Phe 


Arg 


Thr 


Lys 


Arg 


Ser 


Ala 


Leu 


Val 


Arg 


Arg 


Leu 


Trp 


Arg 


Ser 


1 








5 










10 










15 




Arg 


Ala 


Pro 


Gly 


Gly 


Glu 


Asp 


Glu 


Glu 


Glu 


Gly 


Ala 


Gly 


Gly 


Gly 


Gly 








20 










25 










30 






Gly Gly 


Gly 


Glu 


Leu 


Arg 


Gly 


Glu 


Gly 


Ala 


Thr 


Asp 


Ser 


Arg 


Ala 


His 






35 










40 










45 








Gly Ala 


Gly 


Gly 


Gly 


Gly 


Pro 


Gly 


Arg 


Ala 


Gly 


Cys 


Cys 


Leu 


Gly 


Lys 




50 










55 










60 










Ala 


Val 


Arg 


Gly 


Ala 


Lys 


Cys 


His 


His 


His 


Pro 


His 


Pro 


Pro 


Ala 


Ala 


65 










70 










75 










80 


Gly Ala 


Gly 


Ala 


Ala 


Gly 


G!y 


Ala 


Glu 


Ala 


Asp 


Leu 


Lys 


Ala 


Leu 


Thr 










85 










90 










95 




His 


Ser 


Val 


Leu 


Lys 


Lys 


Leu 


Lys 


Glu 


Arg 


Gin 


Leu 


Glu 


Leu 


Leu 


Leu 








100 










105 










110 






Gin 


Ala 


Val 


Glu 


Ser 


Arg 


Gly 


Gly 


Thr 


Arg 


Thr 


Ala 


Cys 


Leu 


Leu 


Leu 






115 










12 0 










125 








Pro 


Gly 


Arg 


Leu 


Asp 


Cys 


Arg 


Leu 


Gly 


Pro 


Gly 


Ala 


Pro 


Ala 


Gly 


Ala 




130 










135 










140 










Gin 


Pro 


Ala 


Gin 


Pro 


Pro 


Ser 


Ser 


Tyr 


Ser 


Leu 


Pro 


Leu 


Leu 


Leu 


Cys 


145 










150 










155 










160 


Lys 


Val 


Phe 


Arg 


Trp 


Pro 


Asp 


Leu 


Arg 


His 


Ser 


Ser 


Glu 


Val 


Lys 


Arg 










165 










170 










175 




Leu 


Cys 


Cys 


Cys 


Glu 


Ser 


Tyr 


Gly 


Lys 


He 


Asn 


Pro 


Glu 


Leu 


Val 


Cys 








180 










185 










190 






Cys 


Asn 


Pro 


His 


His 


Leu 


Ser 


Arg 


Leu 


Cys 


Glu 


Leu 


Glu 


Ser 


Pro 


Pro 






195 










200 










205 








Pro 


Pro 


Tyr 


Ser 


Arg 


Tyr 


Pro 


Met 


Asp 


Phe 


Leu 


Lys 


Pro 


Thr 


Ala 


Asp 




210 










215 










220 










Cys 


Pro 


Asp 


Ala 


Val 


Pro 


Ser 


Ser 


Ala 


Glu 


Thr 


Gly 


Gly 


Thr 


Asn 


Tyr 


225 










230 










235 










240 


Leu 


Ala 


Pro 


Gly 


Gly 


Leu 


Ser 


Asp 


Ser 


Gin 


Leu 


Leu 


Leu 


Glu 


Pro 


Gly 










245 










250 










255 




Asp 


Arg 


Ser 


His 


Trp 


Cys 


Val 


Val 


Ala 


Tyr 


Trp 


Glu 


Glu 


Lys 


Thr 


Arg 








260 










265 










270 






Val 


Gly 


Arg 


Leu 


Tyr 


Cys 


Val 


Gin 


Glu 


Pro 


Ser 


Leu 


Asp 


He 


Phe 


Tyr 






275 










2 80 










285 








Asp 


Leu 


Pro 


Gin 


Gly 


Asn 


Gly 


Phe 


Cys 


Leu 


Gly 


Gin 


Leu 


Asn 


Ser 


Asp 




290 










295 










300 










Asn 


Lys 


Ser 


Gin 


Leu 


Val 


Gin 


Lys 


Val 


Arg 


Ser 


Lys 


He 


Gly 


Cys 


Gly 


305 










310 










315 










320 


He 


Gin 


Leu 


Thr 


Arg 


Glu 


Val 


Asp 


Gly 


Val 


Trp 


Val 


Tyr 


Asn 


Arg 


Ser 










325 










330 










335 




Ser 


Tyr 


Pro 


He 


Phe 


He 


Lys 


Ser 


Ala 


Thr 


Leu 


Asp Asn 


Pro 


Asp 


Ser 








340 










345 










350 






Arg 


Thr 


Leu 


Leu 


Val 


His 


Lys 


Val 


Phe 


Pro 


Gly 


Phe 


Ser 


He 


Lys 


Ala 






355 










360 










365 








Phe 


Asp 


Tyr 


Glu 


Lys 


Ala 


Tyr 


Ser 


Leu 


Gin 


Arg 


Pro 


Asn Asp 


His 


Glu 




370 










375 










380 










Phe 


Met 


Gin 


Gin 


Pro 


Trp 


Thr 


Gly 


Phe 


Thr 


Val 


Gin 


He 


Ser 


Phe 


Val 


385 










390 










395 










400 


Lys 


Gly 


Trp 


Gly 


Gin 


Cys 


Tyr 


Thr 


Arg 


Gin 


Phe 


He 


Ser 


Ser 


Cys 


Pro 
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405 410 415 

Cys Trp Leu Glu Val lie Phe Asn Ser Arg 
420 425 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ATGTTCAGGA CCAAACGATC TGCGCTCGTC CGGCGTCTCT GGAGGAGCCG TGCGCCCGGC 60 

GGCGAGGACG AGGAGGAGGG CGTGGGGGGT GGCGGCGGAG GAGGCGAGCT GCGGGGAGAA 12 0 

GGGGCGACGG ACGGCCGGGC TTATGGGGCT GGTGGCGGCG GTGCGGGCAG GGCTGGCTGC 180 

TGCCTGGGCA AGGCAGTCCG AGGTGC C AAA GGTCACCACC ATCCCCATCC CCCAACCTCG 24 0 

GGTGCCGGGG CGGCCGGGGG CGCCGAGGCG GATCTGAAGG CGCTCACGCA CTCGGTGCTC 3 00 

AAGAAAC TC A AGGAGCGGCA GCTGGAGCTG CTGCTTCAGG CCGTGGAGTC CCGCGGCGGT 3 60 

ACGCGCACCG CGTGCCTCCT GCTGCCCGGC CGCCTGGACT GCAGGCTGGG CCCGGGGGCG 42 0 

CCCGCCAGCG CGCAGCCCGC GCAGCCGCCC TCGTCCTACT CGCTCCCCCT CCTGCTGTGC 480 

AAAGTGTTCA GGTGGCCGGA TCTCAGGCAT TCCTCGGAAG TCAAGAGGCT GTGTTGCTGT 54 0 

GAATCTTACG GGAAGATCAA CCCCGAGCTG GTGTGCTGCA ACCCCCATCA CCTTAGTCGA 60 0 

CTCTGTGAAC TAGAGTCTCC CCCTCCTCCT TACTCCAGAT AC C C AATGGA TTTTCTCAAA 660 

CCAACTGCAG GCTGTCCAGA TGCTGTACCT TCCTCCGCGG AAACCGGGGG AACGAATTAT 72 0 

CTGGCCCCTG GGGGGCTTTC AGATTCCCAA CTTCTTCTGG AGCCTGGGGA TCGGTCACAC 78 0 

TGGTGCGTGG TGGCATACTG GGAGGAGAAG ACTCGCGTGG GGAGGCTCTA CTGTGTCCAA 84 0 

GAGCCCTCCC TGGATATCTT CTATGATCTA CCTCAGGGGA ATGGCTTTTG CCTCGGACAG 900 

CTCAATTCGG ACAACAAGAG TCAGCTGGTA CAGAAAGTGC GGAGCAAGAT CGGCTGTGGC 960 

ATCCAGCTGA CGCGGGAAGT GGATGGCGTG TGGGTTTACA ACCGCAGCAG TTACCCCATC 102 0 

TTCATCAAGT CCGCCACACT GGACAACCCG GACTCCAGGA CGCTGTTGGT GCACAAAGTG 1080 

TTCCCTGGTT TCTCCATCAA GGCTTTTGAC TATGAGAAAG CCTACAGCCT GCAGCGGCCC 114 0 

AATGACCACG AGTTCATGCA GCAACCATGG ACGGGTTTCA CCGTGCAGAT CAGCTTTGTG 12 00 

AAGGGCTGGG GCCAGTGCTA CACCCGCCAG TTCATCAGCA GCTGCCCGTG CTGGCTGGAG 12 6 0 

GTCATCTTCA ACAGCCGGTA G 12 81 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

ATGTTCAGGA CCAAACGATC TGCGCTCGTC CGGCGTCTCT GGAGGAGCCG TGCGCCCGGC 6 0 

GGCGAGGACG AGGAGGAGGG CGCAGGGGGA GGTGGAGGAG GAGGCGAGCT GCGGGGAGAA 12 0 

GGGGCGACGG ACAGCCGAGC GCATGGGGCC GGTGGCGGCG GCCCGGGCAG GGCTGGATGC 18 0 

TGCCTGGGCA AGGCGGTGCG AGGTG C C AAA TGTCACCACC ATCCCCACCC GCCAGCCGCG 24 0 

GGCGCCGGCG CGGCCGGGGG CGCCGAGGCG GATCTGAAGG CGCTCACGCA CTCGGTGCTC 3 00 
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AAGAAAC TG A AGGAGCGGCA GCTGGAGCTG CTGCTCCAGG CCGTGGAGTC CCGCGGCGGG 3 60 

ACGCGCACCG CGTGCCTCCT GCTGCCCGGC CGCCTGGACT GCAGGCTGGG CCCGGGGGCG 42 0 

CCCGCCGGCG CGCAGCCTGC GCAGCCGCCC TCGTCCTACT CGCTCCCCCT CCTGCTGTGC 4 80 

AAAGTGTTCA GGTGGCCGGA TCTCAGGCAT TCCTCGGAAG TCAAGAGGCT GTGTTGCTGT 54 0 

GAATCTTACG GGAAGATCAA CCCCGAGCTG GTGTGCTGCA ACCCCCATCA CCTTAGCCGA 6 00 

CTCTGCGAAC TAGAGTCTCC CCCCCCTCCT TACTCCAGAT ACCCGATGGA TTTTCTCAAA 66 0 

CCAACTGCAG ACTGTCCAGA TGCTGTGCCT TCCTCCGCTG AAACAGGGGG AACGAATTAT 72 0 

CTGGCCCCTG GGGGGCTTTC AGATTCCCAA CTTCTTCTGG AGCCTGGGGA TCGGTCACAC 78 0 

TGGTGCGTGG TGGCATACTG GGAGGAGAAG ACGAGAGTGG GGAGGCTCTA CTGTGTCCAG 84 0 

GAGCCCTCTC TGGATATCTT CTATGAT CT A CCTCAGGGGA ATGGCTTTTG CCTCGGACAG 900 

CTCAATTCGG ACAACAAGAG TCAGCTGGTG CAGAAGGTGC GGAGCAAAAT CGGCTGCGGC 96 0 

ATCCAGCTGA CGCGGGAGGT GGATGGTGTG TGGGTGTACA ACCGCAGCAG TTACCCCATC 102 0 

TTCATCAAGT CCGCCACACT GGACAACCCG GACTCCAGGA CGCTGTTGGT ACACAAGGTG 108 0 

TTCCCCGGTT TCTCCATCAA GGCTTTCGAC TACGAGAAGG CGTACAGCCT GCAGCGGCCC 114 0 

AATGACCACG AGTTTATGCA GCAGCCGTGG ACGGGCTTTA CCGTGCAGAT CAGCTTTGTG 12 00 

AAGGGCTGGG GCCAGTGCTA CACCCGCCAG TTCATCAGCA GCTGCCCGTG CTGGCTAGAG 12 60 

GTCATCTTCA ACAGCCGGTA G 12 81 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ATGTTCAGGT CCAAACGCTC GGGGCTGGTG CGGCGACTTT GGCGAAGTCG TGTGGTCCCC 6 0 

GACCGGGAGG AAGGCGGCAG CGGCGGCGGC GGTGGCGGCG ACGAGGATGG GAGCTTGGGC 12 0 

AGCCGAGCTG AGCCGGCCCC GCGGGCAAGA GAGGGCGGAG GCTGCGGCCG CTCCGAAGTC 180 

CGCCCGGTAG CCCCGCGGCG GCCCCGGGAC GCAGTGGGAC AGCGAGGCGC CCAGGGCGCG 24 0 

GGGAGGCGCC GGCGCGCAGG GGGCCCCCCG AGGCCCATGT CGGAGCCAGG GGCCGGCGCT 3 00 

GGGAGCTCCC TGCTGGACGT GGCGGAGCCG GGAGGCCCGG GCTGGCTGCC CGAGAGTGAC 3 60 

TGCGAGACGG TGACCTGCTG TCTCTTTTCG GAGCGGGACG CCGCCGGCGC GCCCCGGGAC 42 0 

GCCAGCGACC CCCTGGCCGG GGCGGCCCTG GAGCCGGCGG GCGGCGGGCG GAGTCGCGAA 4 80 

GCGCGCTCGC GGCTGCTGCT GCTGGAGCAG GAACTCAAAA CCGTCACGTA CTCGCTGCTG 54 0 

AAGCGGCTCA AGGAGCGCTC GCTGGACACG CTGCTGGAGG CGGTGGAGTC CCGCGGCGGC 600 

GTGCCGGGCG GCTGCGTGCT GGTGCCGCGC GCCGACCTCC GCCTGGGCGG CCAGCCCGCG 660 

CCGCCGCAGC TGCTGCTCGG CCGCCTCTTT CGCTGGCCCG ACCTGCAGCA CGCCGTGGAG 72 0 

CTGAAGCCCC TGTGCGGCTG CCACAGCTTC GCCGCCGCCG CCGACGGCCC TACCGTGTGC 780 

TGCAACCCCT ACCACTTCAG CCGGCTCTGC GGGCCCGAAT CTCCGCCACC TCCCTACTCT 84 0 

CGGCTGTCTC CTCGCGACGA GTACAAGCCA CTGGATCTGT CCGATTCCAC ATTGTCTTAC 90 0 

ACTGAAACGG AGGCTACCAA CTCCCTCATC ACTGCTCCGG GTGAATTCTC AGACGCCAGC 96 0 

ATGTCTCCGG ACGCCACCAA GCCGAGCCAC TGGTGCAGCG TGGCGTACTG GGAGCACCGG 102 0 

ACGCGCGTGG GCCGCCTCTA TGCGGTGTAC GACCAGGCCG TCAGCATCTT CTACGACCTA 108 0 

CCTCAGGGCA GCGGCTTCTG CCTGGGCCAG CTCAACCTGG AGCAGCGCAG CGAGTCGGTG 114 0 

CGGCGAACGC GCAGCAAGAT CGGCTTCGGC ATCCTGCTCA GCAAGGAGCC CGACGGCGTG 12 00 

TGGGC CTAC A ACCGCGGCGA GCACCCCATC TTCGTCAACT CCCCGACGCT GGACGCGCCC 12 60 

GGCGGCCGCG CCCTGGTCGT GCGCAAGGTG CCCCCCGGCT ACTCCATCAA GGTGTTCGAC 132 0 

TTCGAGCGCT CGGGCCTGCA GCACGCGCCC GAGCCCGACG CCGCCGACGG CCCCTACGAC 13 8 0 

CCCAACAGCG TCCGCATCAG CTTCGCCAAG GGCTGGGGGC CCTGCTACTC CCGGCAGTTC 144 0 

ATCACCTCCT GCCCCTGCTG GCTGGAGATC CTCCTCAACA ACCCCAGATA G 14 91 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 496 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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Ala Val 
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He 
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485 490 495 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Glu Ser Pro Pro Pro Pro Tyr Ser Arg Leu Ser Pro Arg Asp Glu Tyr 

15 10 15 

Lys Pro Leu Asp 
20 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Ser Arg Gin Phe He Thr Ser Cys Pro Cys Trp Leu Glu He Leu Asn 

15 10 15 

Pro Arg 



(2) INFORMATION FOR SEQ ID NO : 13 : 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Lys Glu Arg Gin Leu Glu Leu Leu Leu Gin Ala Val Glu Ser Arg Gly 

15 10 15 

Gly Thr Arg Thr Ala 
20 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Lys Ala Val Arg Gly Ala Lys Gly His His His Pro His Pro Pro 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

CTCGAGATCC GCCGCCAGGG CCTTAAAACA AAACGAATGA ATGAAGCGCC CGGGCGAGCT 6 0 

CTGCTGGTCT AAATCGGGCC ACTAACCGGC TCATTCCGCT CCAAACTCGC CCGCTCCATG 12 0 

CCCCCAGCTC TTCCGATTTC CCCCTGCTCC CGCTGGGTCT CCGCCAGACC CCCGGGGCCG 18 0 

TCTCGGCCTC CCACCTCCCT CCACCGTGGG TACCCCTCTC TAGACCTGGG AGAGGGTGGC 24 0 

AGTAACTGGG AGGGGGGTTG AAATAGCTTT TAGAAACCCG ATCTGTTGTT TGCGAAACAC 3 00 

AATCGCTTTT TTTTTTTTTA AAGCGACAGG GTGTCTAGAC GGCCACGTGA CGAGGCCGGA 3 60 

GCCGGGCGCG CCACTGCGCA GTGGAACCAG CCGAGCAGAG GGACGGGTGG GGGGGGCGGG 42 0 

AAGGAGGCGG CGGCGGCTGG GGGCGGGGGA GGGAGGGGTA GAGGGGGGAG GGAAGGGGGC 4 80 

GGAGGCGGGA GGCCTTGCGG GAGGCGGCGA GCCCTGGGCA CATTCGCTCG CTGATCGGCG 54 0 

CACAGAGGAT CTTGTCCCCG AGCTGCGCCA GCAGAGCCAG CCGGGCGCCT CGCTCGGTCC 600 

GCTCGCCGCG CCGGAGAGAG CTGCCTGAGA CGCAGCCAGC CAGCCAGCCG GCGCCACGCC 66 0 

GCCGAGCGCT CGGCCCCGGA GTCCCTGAGT GCGGCGCGGC GAGCCCCCAG CGGCGGCAGA 72 0 

AGGACTCGAG ATC 73 3 
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CLAIMS 

1 . An isolated nucleic acid molecule selected from the group consisting of 

(a) nucleic acid molecules which hybridize under stringent conditions to a molecule 
consisting of the nucleic acid sequence set forth in SEQ ID NO: 3 or SEQ ID NO: 5 and which 

5 code for a polypeptide which inhibits TGF-P superfamily signaling, 

(b) nucleic acid molecules that differ from the nucleic acid molecules of (a) in codon 
sequence due to the degeneracy of the genetic code, and 

(c) complements of (a) and (b). 

10 2. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid molecule 
comprises SEQ ID NO:7 or SEQ ID NO:8. 

3. The isolated nucleic acid molecule of claim 2, wherein the isolated nucleic acid molecule 
consists of SEQ ID NO:7. 

15 

4. The isolated nucleic acid molecule of claim 2, wherein the isolated nucleic acid molecule 
consists of SEQ ID NO:8. 

5. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid molecule 
20 consists of SEQ ID NO:3. 

6. The isolated nucleic acid molecule of claim 1 , wherein the isolated nucleic acid molecule 
consists of SEQ ID NO:5. 

25 7. An isolated nucleic acid molecule selected from the group consisting of (a) a unique 

fragment of SEQ ID NO: 3 between 12 and 1944 nucleotides in length, (b) a unique fragment of 
SEQ ID NO: 5 between 12 and 1875 nucleotides in length, (c) complements of "(a)" and (d) 
complements of "(b)", provided that the nucleic acid molecule excludes sequences consisting of 
SEQ ID NO:l and SEQ ID NO:2. 

30 

8. The isolated nucleic acid molecule of claim 7, wherein the isolated nucleic acid molecule 
consists of at least 14 contiguous nucleotides. 
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9. The isolated nucleic acid molecule of claim 7, wherein the isolated nucleic acid molecule 
consists of at least 15 contiguous nucleotides. 

10. The isolated nucleic acid molecule of claim 7, wherein the isolated nucleic acid molecule 
5 consists of at least 16 contiguous nucleotides. 

1 1 . The isolated nucleic acid molecule of claim 7, wherein the isolated nucleic acid molecule 
consists of at least 1 7 contiguous nucleotides. 

10 12. The isolated nucleic acid molecule of claim 7, wherein the isolated nucleic acid molecule 
consists of at least 1 8 contiguous nucleotides. 

13. The isolated nucleic acid molecule of claim 7, wherein the isolated nucleic acid molecule 
consists of at least 20 contiguous nucleotides. 

15 

14. The isolated nucleic acid molecule of claim 7, wherein the isolated nucleic acid molecule 
consists of at least 22 contiguous nucleotides. 

15. The isolated nucleic acid molecule of claim 7, wherein the isolated nucleic acid molecule 
20 consists of between 12 and 32 contiguous nucleotides. 

16. An expression vector comprising the isolated nucleic acid molecule of claims 1, 2, 3, 4, 5, 
6, 7, 8, 9, 10, 11, 12, 13, 14 or 15 operably linked to a promoter. 

25 17. A host cell transformed or transfected with the expression vector of claim 16. 

17B. A transgenic non-human animal comprising the expression vector of claim 16. 

17B2. The transgenic non-human animal of claim 17B, comprising a conditional Smad7 
30 expression vector. 

17C. A transgenic non-human animal which has reduced expression of Smad7 nucleic 
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acid molecules. 

17D. The transgenic non-human animal of claim 17C, comprising a Smad7 gene 
disrupted by homologous recombination. 

5 

17E. The transgenic non-human animal of claim 17C, comprising a conditional Smad7 
gene disruption. 

17F. The transgenic non-human animal of claim 17C, comprising a /raws-acting negative 
1 0 regulator of Smad7 expression. 

1 8. An isolated polypeptide encoded by the isolated nucleic acid molecule of claim 1,2,3,4, 
5 or 6, wherein the polypeptide has TGF-P superfamily signaling inhibitory activity. 

15 19. The isolated polypeptide of claim 1 8, wherein the isolated polypeptide consists of a 
molecule comprising the amino acid sequence of SEQ ID NO: 4 or SEQ ID NO:6. 

20. The isolated polypeptide of claim 18, wherein the isolated polypeptide consists of a 
functional fragment or variant. 

20 

21 . The isolated polypeptide of claim 20, wherein the isolated polypeptide consists of amino 
acids 204-426 of SEQ ID NO:4 or 6. 

2 IB. The isolated polypeptide of claim 20, wherein the isolated polypeptide consists of amino 
25 acids 2-261 of SEQ ID NO:4 or 6. 

22. An isolated polypeptide which binds selectively a polypeptide encoded by the isolated 
nucleic acid molecule of claim 1, 2, 3, 4, 5 or 6, provided that the isolated polypeptide is not a 
TGFP superfamily type I receptor. 

30 

23. The isolated polypeptide of claim 22, wherein the isolated polypeptide binds to an 
epitope defined by a polypeptide consisting of the sequence of SEQ ID NOs:4 or 6. 
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23B. The isolated polypeptide of claim 22, wherein the polypeptide is an antibody or fragment 
thereof which selectively binds an epitope defined by a polypeptide selected from the group 
consisting of SEQ ID NO:l 1, SEQ ID NO:12, SEQ ID NO:13 5 and SEQ ID NO:14. 

5 24. The isolated polypeptide of claim 22, wherein the isolated polypeptide is an antibody 
fragment selected from the group consisting of a Fab fragment, a F(ab) 2 fragment or a fragment 
including a CDR3 region selective for a Smad7 polypeptide. 

24B. The isolated polypeptide of claim 22, wherein the isolated polypeptide is a monoclonal 
10 antibody, a humanized antibody or a chimeric antibody. 

25. An isolated complex of polypeptides comprising: 

a polypeptide as claimed in claim 1 8 bound to a receptor selected from the group 
consisting of activated TGFp superfamily type I receptors and complexes of TGFp superfamily 
15 type I receptors and TGFp superfamily type II receptors. 

25B. The isolated complex of claim 25, wherein the receptor is selected from the group 
consisting of TpRI, BMPR-IA, BMPR-IB, ActR-IA, a complex of TpRI and TpRII, a complex 
of BMPR-IA and BMPR-II, a complex of BMPR-IB and BMPR-II, a complex of ActR-IA and 
20 BMPR-II and a complex of ActR-IA and ActR-II. 

26. The isolated complex of polypeptides of claim 25, wherein the polypeptide as claimed in 
claim 1 consists of the amino acid sequence of SEQ ID NO:6. 

25 27. A method for reducing TGF-P superfamily signal transduction in a mammalian cell, 
comprising 

contacting the mammalian cell with an amount of an agent which increases the amount of 
Smad7 or a fragment thereof in the cell effective to reduce TGF-P superfamily signal 
transduction in the mammalian cell. 

30 

28. The method of claim 27, wherein the TGF-p superfamily signal transduction is signal 
transduction mediated by a TGFp superfamily ligand. 
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28B. The method of claim 28, wherein the TGFp superfamily ligand is selected from the group 
consisting of TGF-pl, activin, Vgl, BMP-4 and BMP-7. 

29. The method of claim 28, wherein the agent is selected from the group consisting of a 
5 nucleic acid encoding the amino acid sequence of SEQ ID NO:4, a nucleic acid encoding the 

amino acid sequence of SEQ ID NO: 6, a nucleic acid encoding amino acids 204-426 of SEQ ID 
NO:4, a nucleic acid encoding amino acids 204-426 of SEQ ID NO:6, a nucleic acid encoding 
amino acids 2-261 of SEQ ID NO:4, a nucleic acid encoding amino acids 2-261 of SEQ ID 
NO: 6, and a polypeptide as claimed in claim 18. 

10 

30. The method of claim 28, wherein the agent is selected from the group consisting of 
activin, epidermal growth factor and phorbol esters. 

31. A method for reducing phosphorylation of a pathway specific Smad polypeptide in a 
15 mammalian cell, comprising 

contacting the mammalian cell with an amount of the isolated nucleic acid of claim 1 or 
an isolated Smad polypeptide expression product thereof, effective to reduce phosphorylation of 
the pathway specific Smad polypeptide in the mammalian cell. 

20 3 IB. The method of claim 3 1 , wherein the pathway specific Smad polypeptide is selected from 
the group consisting of Smad 1, Smad2, Smad3 and Smad5. 

31C. The method of claim 31, wherein the isolated Smad polypeptide is selected from the 
group consisting of Smad7, Smad8, a N-terminal fragment of Smad7 and a C-terminal fragment 
25 ofSmad7. 

32. A method for modulating the proliferation of a cell, the proliferation of which is altered 
by TGF-P superfamily signal transduction, comprising 

contacting the cell with an amount of an isolated nucleic acid of claim 1 or an isolated 
30 Smad polypeptide expression product thereof, effective to modulate the proliferation of the cell 



33. A method of increasing TGF-P superfamily signal transduction in a cell comprising 
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contacting the cell with an amount of an agent that selectively binds to an isolated nucleic 
acid molecule of claim 1 or an isolated Smad polypeptide expression product thereof, effective to 
increase the TGF-P superfamily signal transduction in the cell. 

5 34. The method of claim 33 wherein the TGF-P superfamily signal transduction is mediated 
by a TGFp superfamily ligand selected from the group consisting of TGF-pl, activin, Vgl, 
BMP-4 and BMP-7. 

35. The method of claim 34, wherein the agent is an antisense nucleic acid. 

10 

36. The method of claim 34, wherein the agent is a polypeptide. 

37. The method of claim 36, wherein the polypeptide is selected from the group consisting of 
(1) an antibody which binds a polypeptide selected from the group consisting of a polypeptide 

15 comprising the amino acid sequence of SEQ ID NO:4, a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 6, a polypeptide comprising the amino acid sequence of SEQ ID 
NO: 13, a polypeptide comprising the amino acid sequence of SEQ ID NO: 14, a N-terminal 
fragment of Smad7 and a C-terminal fragment of Smad7, and (2) a dominant negative variant of 
Smad7. 

20 

38. A composition comprising: 

the polypeptide of claim 18, and 

a pharmaceutically acceptable carrier. 

25 39. A method for decreasing Smad7 TGF-P superfamily inhibitory activity in a subject 
comprising 

administering to a subject in need of such treatment an agent that selectively binds to an 
isolated nucleic acid molecule of claim 1 or an expression product thereof, in an amount effective 
to decrease Smad7 TGF-P superfamily inhibitory activity in the subject. 

30 

40. The method of claim 39 wherein the TGF-p superfamily inhibitory activity is inhibition 
of TGF-P superfamily signal transduction mediated by a TGFp superfamily ligand selected from 
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the group consisting of TGF-P 1, activin, Vgl, BMP -4 and BMP-7. 

4L The method of claim 39, wherein the agent is an antisense nucleic acid. 

5 42. The method of claim 39, wherein the agent is a polypeptide. 

43. The method of claim 42, wherein the polypeptide is selected from the group consisting of 
(1) an antibody which binds a polypeptide selected from the group consisting of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:4, a polypeptide comprising the amino acid 
10 sequence of SEQ ID NO: 6, a polypeptide comprising the amino acid sequence of SEQ ID 
NO: 1 3, a polypeptide comprising the amino acid sequence of SEQ ID NO: 14, a N-terminal 
fragment of Smad7 and a C-terminal fragment of Smad7, and (2) a dominant negative variant of 
Smad7. 



15 44. A method for diagnosing induction of a TGF-p superfamily ligand in a cell, comprising: 

(a) measuring the amount of Smad7 RNA or polypeptide in the cell, and 

(b) comparing the result of (a) with a control. 



45. A method for determining the presence of a functional TGFp superfamily receptor in a 
20 cell, comprising: 

(a) contacting the cell with an amount of TGFp superfamily ligand effective to increase 
the amount of Smad7 in the cell, 

(b) measuring the amount of Smad7 RNA or polypeptide in the cell, and 

(c) comparing the result of (b) with a control, wherein an increased amount of Smad7 

25 RNA or polypeptide in the cell indicates the presence of a functional TGFp superfamily receptor. 

45B. The method of claim 45, wherein the TGFp superfamily receptor is selected from the 
group consisting of TGF-p superfamily type I receptors, TGF-P superfamily type II receptors, 
and complexes of TGF-p superfamily type I receptors and TGF-p superfamily type II receptors. 



46. A method for identifying lead compounds for a pharmacological agent useful in the 
diagnosis or treatment of disease associated with Smad7 TGF-P superfamily inhibitory activity, 
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comprising 

forming a mixture comprising a Smad7 polypeptide, a TGF-p superfamily receptor 
complex or an activated TGFp superfamily type I receptor, and a candidate pharmacological 
agent, 

5 incubating the mixture under conditions which, in the absence of the candidate 

pharmacological agent, permit a first amount of specific binding of the TGF-p superfamily 
receptor complex or activated TGFp superfamily type I receptor by the Smad7 polypeptide, and 

detecting a test amount of the specific binding of the TGF-P superfamily receptor 
complex or activated TGFp superfamily type I receptor by the Smad7 polypeptide, wherein 

10 reduction of the test amount of specific binding relative to the first amount of specific binding 
indicates that the candidate pharmacological agent is a lead compound for a pharmacological 
agent which disrupts the Smad7 TGF-P superfamily signal transduction inhibitory activity, and 
wherein increase of the test amount of specific binding relative to the first amount of specific 
binding indicates that the candidate pharmacological agent is a lead compound for a 

15 pharmacological agent which enhances the Smad7 TGF-p superfamily signal transduction 
inhibitory activity. 

47. The method of claim 46, wherein members of the TGF-p superfamily are selected from 
the group consisting of TGF-P 1, activin, Vgl, BMP-4 and BMP-7. 

20 

47B. The method of claim 46, wherein the Smad7 polypeptide is selected from the group 
consisting of a polypeptide comprising the amino acid sequence of SEQ ID NO:4, a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 6, a N-terminal fragment of Smad7 and a C- 
terminal fragment of Smad7. 

25 

48. A method for identifying lead compounds for a pharmacological agent useful in the 
diagnosis or treatment of disease associated with Smad7 TGF-p superfamily signal transduction 
inhibitory activity, comprising 

forming a mixture comprising a Smad7 polypeptide, a TGF-p superfamily receptor 
30 complex or an activated TGF-p superfamily type I receptor, a pathway specific Smad 
polypeptide and a candidate pharmacological agent, 

incubating the mixture under conditions which, in the absence of the candidate 
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pharmacological agent, permit a first amount of TGF-P superfamily induced phosphorylation of 
the pathway specific Smad polypeptide, and 

detecting a test amount of TGF-P superfamily induced phosphorylation of the pathway 
specific Smad polypeptide, wherein reduction of the test amount of the phosphorylation of the 
5 pathway specific Smad polypeptide relative to the first amount of the phosphorylation of the 
pathway specific Smad polypeptide indicates that the candidate pharmacological agent is a lead 
compound for a pharmacological agent which enhances the TGF-P superfamily signal 
transduction inhibitory activity of Smad7 and wherein increase of the test amount of the 
phosphorylation of the pathway specific Smad polypeptide relative to the first amount of the 
10 phosphorylation of the pathway specific Smad polypeptide indicates that the candidate 

pharmacological agent is a lead compound for a pharmacological agent which disrupts the TGF- 
p superfamily signal transduction inhibitory activity of Smad7. 

48B. The method of claim 48, wherein the pathway specific Smad polypeptide is selected from 
15 the group consisting of Smadl, Smad2, Smad3 and Smad5. 

48C. The method of claim 48, wherein the Smad7 polypeptide is selected from the group 
consisting of a polypeptide comprising the amino acid sequence of SEQ ID NO:4, a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 6, a N-terminal fragment of Smad7 and a C- 
20 terminal fragment of Smad7. 

49. The method of claim 48, wherein members of the TGF-P superfamily are selected from 
the group consisting of TGF-P 1, activin, Vgl, BMP-4 and BMP-7. 

25 50. A method for reducing expression of a Smad6 or Smad7 nucleic acid or expression 
product thereof in a cell, comprising 

contacting the cell with an amount of an agent which binds selectively to a Smad4 nucleic 
acid or polypeptide effective to reduce the expression of the Smad6 or Smad7 nucleic acid or 
expression product thereof in the cell. 

30 

5 1 . The method of claim 50, wherein the agent is an antisense Smad4 molecule. 



WO 98/53068 PCT/US98/10305 

- 86 - — 

52. The method of claim 50, wherein the agent is an antibody that selectively binds to 
Smad4. 

53. A method for increasing Smad6 or Smad7 expression in a cell, comprising 

5 contacting the cell with an agent selected from the group consisting of activin, epidermal 

growth factor and phorbol esters in an amount effective to increase Smad6 or Smad7 expression 
in the cell. 

54. A method for treating a subject having lung cancer characterized by elevated expression 
10 of a gene selected from the group consisting of Smad6 and Smad7, comprising, 

administering to the subject an amount of an agent which reduces the expression or 
activity of a nucleic acid molecule or an expression product thereof encoded by the gene. 

54B. The method of claim 54, wherein the agent selectively binds the nucleic acid molecule or 
1 5 an expression product thereof. 

54C. The method of claim 54B, wherein the agent is an antisense nucleic acid molecule 

54D. The method of claim 54B, wherein the agent is a polypeptide. 

20 

54E. The method of claim 54D, wherein the polypeptide is an antibody or fragment thereof 
which binds a polypeptide selected from the group consisting of a polypeptide comprising the 
amino acid sequence of SEQ ID NO:4, a polypeptide comprising the amino acid sequence of 
SEQ ID NO:6, a N-terminal fragment of Smad7 and a C-terminal fragment of Smad7, a 
25 polypeptide comprising the amino acid sequence of SEQ ID NO: 1 0, a polypeptide comprising 
the amino acid sequence of SEQ ID NO:l 1, a polypeptide comprising the amino acid sequence 
of SEQ ID NO: 12, a polypeptide comprising the amino acid sequence of SEQ ID NO: 13, and a 
polypeptide comprising the amino acid sequence of SEQ ID NO: 14. 

30 54F. The method of claim 54B, wherein the agent is a dominant negative variant of Smad6 or 
Smad7. 
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55. A method for reducing eye defects in a developing mammalian embryo, comprising 
contacting the cells of the embryo with an agent which reduces the expression or activity of a 
Smad7 nucleic acid molecule or an expression product thereof. 

5 56. The method of claim 55 , wherein the agent selectively binds the Smad7 nucleic acid 
molecule or an expression product thereof. 

57. The method of claim 56, wherein the agent is an antisense nucleic acid molecule 
10 58. The method of claim 56, wherein the agent is a polypeptide. 

59. The method of claim 58, wherein the polypeptide is an antibody or fragment thereof 
which binds a polypeptide selected from the group consisting of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 4, a polypeptide comprising the amino acid sequence of 

15 SEQ ID NO: 6, a polypeptide comprising the amino acid sequence of SEQ ID NO: 13, a 

polypeptide comprising the amino acid sequence of SEQ ID NO: 14, aN-terminal fragment of 
Smad7 and a C-terminal fragment of Smad7. 

60. The method of claim 56, wherein the agent is a dominant negative variant of Smad7. 

20 

61 . An isolated polypeptide which is localized in the nucleus of a cell having a TGFp 
superfamily receptor and which is exported from the nucleus to the cytoplasm of the cell upon 
TGFp superfamily receptor-mediated signal transduction, comprising 

a first polypeptide or fragment thereof linked to a second polypeptide or fragment thereof, 
25 wherein the second polypeptide or fragment thereof comprises a MH2 domain of Smad7. 

6 IB. An fusion protein comprising a Smad7 MH2 domain or a nuclear localization fragment 
thereof. 

30 62. A fusion protein comprising a Smad7 MH2 domain or a transcriptional activation 
fragment thereof. 
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63. A method for modulating transcription of Smad7-regulated gene transcription, 
comprising 

contacting a mammalian cell with an agent which modulates TGFp superfamily receptor- 
mediated signal transduction in an amount effective to modulate Smad7 -regulated gene 
5 transcription. 

64. The method of claim 63, wherein the agent is a TGFp superfamily ligand or an inhibitor 
of TGFp superfamily receptor-mediated signal transduction. 

10 65. An isolated nucleic acid molecule comprising the nucleotide sequence of SEQ ID 
NO: 15, an allelic variant thereof, or a functional fragment thereof which confers TGFP 
regulation. 

66. An isolated nucleic acid molecule comprising a unique fragment of SEQ ID NO:15. 

15 

67. An expression vector comprising the isolated nucleic acid molecule of claim 65 or 
claim 66. 

68. A method for regulating transcription of a first nucleic acid molecule, comprising 
20 preparing a construct comprising the first nucleic acid molecule operably linked to 

the isolated nucleic acid molecule of claim 65 or claim 66, and 
introducing the construct into an expression system. 

69. The method of claim 68, wherein the expression system is a cell. 

25 

70. The method of claim 69, wherein the cell expresses a TGFp receptor, further 
comprising contacting the cell with TGFp to increase expression of the first nucleic acid 
molecule. 



30 



71. A method for identifying modulators of TGFp-regulated transcriptional activity, 
comprising 

providing an expression system with a reporter construct including SEQ ID NO: 1 5 or a 
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TGFp-regulated fragment thereof operably linked to a nucleic acid encoding a detectable 

expression product, 

contacting the expression system with a candidate modulator compound, 
incubating the expression system under conditions which, in the absence of the candidate 
5 modulator, permit a first amount of expression of the detectable expression product, and 

detecting a test amount of the expression the detectable expression product, wherein 

detection of an increase in the expression of the detectable expression product in the presence of 

the candidate modulator compound indicates that the candidate modulator compound is a 

compound which increases TGFp-regulated transcriptional activity, and wherein detection of a 
10 decrease in the expression of the detectable expression product in the presence of the candidate 

modulator compound indicates that the candidate modulator compound is a compound which 

decreases TGFp-regulated transcriptional activity. 

72. The method of claim 71, wherein the expression system is a selelcted from the group 
15 consisting of cells and in vitro cell-free transcription systems. 



73. The method of claim 71, wherein the detectable expression product is a reporter protein. 
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SMAD7 AND USES THEREOF 

Field of the Invention 

This invention relates to nucleic acids and encoded polypeptides which interact with 
5 TGF-p superfamily receptors and which are negative regulators of signaling by those receptors. 
The invention also relates to agents which bind the nucleic acids or polypeptides. The invention 
further relates to methods of using such nucleic acids and polypeptides in the treatment and/or 
diagnosis of disease. 

10 Background of the Invention 

During mammalian embryogenesis and adult tissue homeostasis transforming growth 
factor P (TGF-p) performs pivotal tasks in intercellular communication (Roberts et al., 1993). 
The cellular effects of this pleiotropic factor are exerted by ligand-induced hetero- 
oligomerization of two distantly related type I and type II serine/threonine kinase receptors, 

15 TpR-I and TpR-II, respectively (Lin and Lodish, 1993; Derynck, 1994; Massague and Weis- 
Garcia, 1996; ten Dijke et al, 1996). The two receptors, which both are required for signaling, 
act in sequence; TpR-I is a substrate for the constitutively active TpR-II kinase (Wrana et al, 
1994; Weiser et al., 1995). TGF-P forms part of a large family of structurally related proteins 
which include activins and bone morphogenetic proteins (BMPs) that signal in a similar fashion, 

20 each employing distinct complexes of type I and type II serine/threonine kinase receptors (Lin 
and Lodish, 1993; Derynck, 1994; Massague and Weis-Garcia, 1996; ten Dijke et ah, 1996). 

Genetic studies of TGF-p-like signalling pathways in Drosophila and Caenorhabditis 
elegans have led to the identification of mothers against dpp (Mad) (Sekelsky et al, 1995) and 
sma (Savage et al, 1996) genes, respectively. The products of these related genes perform 

25 essential functions downstream of TGF-p-like ligands acting via serine/threonine kinase 
receptors in these organisms (Wiersdorf et al, 1996; Newfeld et al., 1996; Hoodless et al, 
1 996). Vertebrate homologs of Mad and sma have been termed Smads (Derynck et al, 1996) or 
MADR genes (Wrana and Attisano, 1996). Genetic alterations in Smad2 and Smad4/DPC4 have 
been found in specific tumor subsets, and thus Smads may function as tumor suppressor genes 

30 (Hahn et al, 1996; Riggins et al, 1996; Eppert et al, 1996). Smad proteins share two regions of 
high similarity, termed MH1 and MH2 domains, connected with a variable proline-rich sequence 
(Massague, 1996; Derynck and Zhang, 1996). The C-terminal part of Smad2, when fused to a 
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heterologous DNA-binding domain, was found to have transcriptional activity (Liu et al. 9 1996; 
Meersseman et aL, 1997). The intact Smad2 protein when fused to a DNA-binding domain, was 
latent, but transcriptional activity was unmasked after stimulation with ligand (Liu et aL, 1996). 

Different Smads specify different responses using functional assays in Xenopus. Whereas 
5 Smadl induces ventral mesoderm, a BMP-like response, Smad2 induces dorsal mesoderm, an 
activin/TGF-p-like response (Graff et aL, 1996; Baker and Harland, 1996; Thomsen, 1996). 
Upon ligand stimulation Smads become phosphorylated on serine and threonine residues; BMP 
stimulates Smadl phosphorylation, whereas TGF-0 induces Smad2 and Smad3 phosphorylation 
(Hoodless et aL, 1996; Liu et aL, 1996; Eppert et aL, 1996; Lechleider et aL, 1996; Yingling et 

10 a/., 1996; Zhang et aL, 1996; Macias-Silva et aL, 1996; Nakao et aL, 1996). Thus certain Smads 
are pathway specific. Pathway specific Smads include Smadl, Smad2, Smad3 and Smad5. 

Smad4 is a common component of TGF-p, activin and BMP signaling (Lagna et aL, 
1996; Zhang et aL, 1997; de Winter et aL, 1997). Smad4 phosphorylation has thus far been 
reported only after activin stimulation of transfected cells (Lagna et aL 9 1996). After stimulation 

15 with TGF-P or activin Smad4 interacts with Smad2 or Smad3, and upon BMP challenge a 

heteromeric complex of Smad4 and Smadl has been observed (Lagna et aL, 1996). Upon ligand 
stimulation, Smad complexes translocate from the cytoplasm to the nucleus (Hoodless et aL, 
1996; Liu et aL, 1996; Baker and Harland, 1996; Macias-Silva et al, 1996), where they, in 
combination with DNA-binding proteins, may regulate gene transcription (Chen et aL, 1996). 

20 

Summary of the Invention 

The invention provides isolated Smad7 nucleic acid molecules, unique fragments of those 
molecules, expression vectors containing the foregoing, and host cells transfected with those 
molecules. The invention also provides isolated polypeptides encoded by the Smad7 nucleic 

25 acids and agents which bind such polypeptides, including antibodies. The foregoing can be used 
in the diagnosis or treatment of conditions characterized by the expression of a Smad7 nucleic 
acid or polypeptide, or lack thereof. The invention also provides methods for identifying 
pharmacological agents useful in the diagnosis or treatment of such conditions. Here, we present 
the identification of Smad7, which opposes pathway specific Smads including Smadl, Smad2 

30 and Smad3 and thus is an inhibitor of the TGF-P superfamily signalling pathways. 

According to one aspect of the invention, an isolated nucleic acid molecule is provided. 
The molecule hybridizes under stringent conditions to a molecule consisting of the nucleic acid 
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sequence of SEQ ID NOs:3 or 5. The isolated nucleic acid molecule codes for a polypeptide 
which inhibits TGF-P superfamily signaling. The invention further embraces nucleic acid 
molecules that differ from the foregoing isolated nucleic acid molecules in codon sequence due 
to the degeneracy of the genetic code. The invention also embraces complements of the 
5 foregoing nucleic acids. 

In certain embodiments, the isolated nucleic acid molecule comprises a molecule 
consisting of the nucleic acid sequence of SEQ ID NO: 7 or 8. Preferably, the isolated nucleic 
acid molecule consists of the nucleic acid sequence of SEQ ID NO:7 or 8, or consists essentially 
of the nucleic acid sequence of SEQ ID NO: 3 or 5. 

10 According to another aspect of the invention, an isolated nucleic acid molecule is 

provided. The isolated nucleic acid molecule comprises a molecule consisting of a unique 
fragment of SEQ ID NO:3 between 12 and 1944 nucleotides in length and complements thereof, 
or a unique fragment of SEQ ID NO:5 between 12 and 1875 nucleotides in length and 
complements thereof, provided that the isolated nucleic acid molecule excludes sequences 

15 consisting only of SEQ ID NO:l and SEQ ID NO:2. Preferably the isolated nucleic acid 

molecule excludes molecules consisting solely of nucleotide sequences selected from the group 
consisting of accession numbers AA061644 (SEQ ID NO:l), AA022262 (SEQ ID NO:2), 
AA347307, AA348247, 321995, W78627, W40869, AA033426, AA397050, AA016891, and 
C851 15. In one embodiment, the isolated nucleic acid molecule consists of between 12 and 32 

20 contiguous nucleotides of SEQ ID NO:3, SEQ ID NO:5, or complements of such nucleic acid 
molecules. In preferred embodiments, the unique fragment is at least 14, 15, 16, 17, 18, 20 or 22 
contiguous nucleotides of the nucleic acid sequence of SEQ ID NO:3, SEQ ID NO:5, SEQ ID 
NO:7, SEQ ID NO:8, or complements thereof. 

According to another aspect of the invention, the invention involves expression vectors, 

25 and host cells transformed or transfected with such expression vectors, comprising the nucleic 
acid molecules described above. 

According to still other aspects of the invention, transgenic non-human animals are 
provided. The animals include in certain embodiments the foregoing expression vectors. In 
certain preferred embodiments, the transgenic non-human animal includes a conditional Smad7 

30 expression vector, such as an expression vector that increases expression of Smad7 in a tissue 
specific, development stage specific, or inducible manner. In other embodiments, the transgenic 
non-human animal has reduced expression of Smad7 nucleic acid molecules. In some 
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embodiments, the transgenic non-human animal includes a Smad7 gene disrupted by 
homologous recombination. The disruption can be homozygous or heterozygous. In other 
embodiments, the transgenic non-human animal includes a conditional Smad7 gene disruption, 
such as one mediated by e.g. tissue specific, development stage specific, or inducible, expression 
5 of a recombinase. In yet other embodiments, the transgenic non-human animal includes a trans- 
acting negative regulator of Smad7 expression, such as antisense Smad7 nucleic acid molecules, 
nucleic acid molecules which encode dominant negative Smad7 proteins, Smad7 directed 
ribozymes , etc. 

According to another aspect of the invention, an isolated polypeptide is provided. The 
10 isolated polypeptide is encoded by the isolated nucleic acid molecule of any of claims 1, 2, 3, 4, 
5 or 6, and the polypeptide inhibits TGF-P superfamily signaling activity. 

In other embodiments, the isolated polypeptide consists of a fragment or variant of the 
foregoing which retains the activity of the foregoing. In preferred embodiments, the fragment is 
a C-terminal fragment of Smad7, preferably amino acids 204-426 of SEQ ID NO:4 or SEQ ID 
15 NO:6, or a N-terminal fragment of Smad7, preferably amino acids 2-261 of SEQ ID NO:4 or 
SEQ ID NO:6. 

According to another aspect of the invention, there are provided isolated polypeptides 
which selectively bind a Smad7 protein or fragment thereof, provided that the isolated 
polypeptide is not a TGF-p superfamily type I receptor (e.g., a TGF-p, activin or BMP receptor). 

20 The isolated polypeptide in certain embodiments binds to a polypeptide encoded by the isolated 
nucleic acid molecule of any of claims 1, 2, 3, 4, 5 or 6. In preferred embodiments, isolated 
binding polypeptides include antibodies and fragments of antibodies (e.g., Fab, F(ab) 2 , Fd and 
antibody fragments which include a CDR3 region which binds selectively to an epitope defined 
by the Smad7 polypeptides of the invention, such as SEQ ID NOs:4 or 6). In other preferred 

25 embodiments, the polypeptide is an antibody or fragment thereof which selectively binds an 
epitope defined by a polypeptide selected from the group consisting of SEQ ID NO: 1 1 , SEQ ID 
NO: 12, SEQ ID NO:13, and SEQ ID NO:14. In still other preferred embodiments, the isolated 
polypeptide is a monoclonal antibody, a humanized antibody or a chimeric antibody. 

30 The invention provides in another aspect an isolated complex of polypeptides. The 

isolated complex includes a TGF-p superfamily receptor selected from the group consisting of 
activated TGFp superfamily type I receptors and complexes of TGFp superfamily type I 
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receptors and TGFp superfamily type II receptors (e.g. a TGF-P, activin, Vgl or BMP receptors) 
bound to a polypeptide as claimed in claim 1. Preferably the isolated complex includes a 
polypeptide having the amino acid sequence of SEQ ID NO:4 or SEQ ID NO:6. In other 
preferred embodiments, the receptor is selected from the group consisting of TpRI, BMPR-IA, 
5 BMPR-IB, ActR-IA, a complex of TpRI and TpRII, a complex of BMPR-IA and BMPR-II, a 
complex of BMPR-IB and BMPR-II, a complex of ActR-IA and BMPR-II and a complex of 
ActR-IA and ActR-IL 

According to still another aspect of the invention, methods for reducing TGF-P 
superfamily signal transduction in a mammalian cell are provided. The methods involve 

10 contacting a mammalian cell with an amount of an inhibitor of TGF-P superfamily signal 

transduction effective to reduce such signal transduction in the mammalian cell. Preferably the 
TGF-p superfamily signal transduction is mediated by a TGFp superfamily ligand, particularly 
TGF-pl, activin, Vgl, BMP-4 and/or BMP-7. Other methods are provided for reducing 
phosphorylation of pathway specific Smads (e.g. Smadl, Smad2, Smad3 and/or Smad5) by 

15 contacting a mammalian cell with the inhibitor disclosed above. In certain embodiments of the 
foregoing methods, the inhibitor is an isolated Smad7 polypeptide or a fragment thereof, such as 
a polypeptide encoded by a nucleic acid which hybridizes under stringent conditions to SEQ ID 
NO:3 or 5, nucleic acids which encode the polypeptide of SEQ ID NO:4 or SEQ ID NO:6 or 
degenerates or complements thereof. In some embodiments the nucleic acid encodes amino 

20 acids 204-426 of SEQ ID NO:4 or SEQ ID NO:6 or amino acids 2-261 of SEQ ID NO:4 or SEQ 
ID NO:6. In still other embodiments, the inhibitor is an isolated Smad8 polypeptide or a 
fragment thereof 

According to still another aspect of the invention, methods for modulating proliferation 
and/or differentiation of a cell are provided. The methods involve contacting a cell with an 
25 amount of an isolated Smad7 polypeptide,or a nucleic acid encoding and expressing such a 

polypeptide, as described above, effective to modulate the proliferation and/or differentiation of 
the cell. 

The invention in a further aspect provides methods for increasing TGF-p superfamily 
signal transduction in a mammalian cell. The mammalian cell is contacted with an agent that 
30 selectively binds to an isolated nucleic acid molecule of the invention or an expression product 
thereof in an amount effective to increase TGF-P superfamily signal transduction. Preferably the 
TGF-P superfamily signal transduction is mediated by a TGFp superfamily ligand selected from 
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the group consisting of TGF-pl, activin, Vgl, BMP-4 and BMP-7. Preferred agents are 
antisense nucleic acids, including modified nucleic acids, and polypeptides including antibodies 
which bind to the polypeptide including the amino acids of SEQ ID NO:4 , the polypeptide 
including the amino acids of SEQ ID NO: 6, the polypeptide including the amino acids of SEQ 
5 ID NO: 1 3, the polypeptide including the amino acids of SEQ ID NO: 14, a N-terminal fragment 
of Smad7 or a C-terminal fragment of Smad7, and dominant negative variants of the polypeptide 
of SEQ ID NO:4 or SEQ ID NO:6. 

The invention in still another aspect provides compositions comprising a Smad7 
polypeptide and a pharmaceutically acceptable carrier. 

10 The invention in a further aspect involves methods for decreasing Smad7 TGF-P 

superfamily inhibitory activity in a subject. An agent that selectively binds to an isolated nucleic 
acid molecule of the invention or an expression product thereof is administered to a subject in 
need of such treatment, in an amount effective to decrease TGFp superfamily signal transduction 
inhibitory activity of Smad7 in the subject. Preferably the TGFp superfamily signal transduction 

15 is mediated by a TGFp superfamily ligand selected from the group consisting of TGF-pl, 
activin, Vgl, BMP-4 and BMP-7. Preferred agents are antisense nucleic acids, including 
modified nucleic acids, and polypeptides including antibodies which bind to the polypeptide 
including the amino acids of SEQ ID NO:4 , the polypeptide including the amino acids of SEQ 
ID NO: 6, the polypeptide including the amino acids of SEQ ID NO: 13, the polypeptide including 

20 the amino acids of SEQ ID NO: 14, a N-terminal fragment of Smad7 or a C-terminal fragment of 
Smad7, and dominant negative variants of the polypeptide of SEQ ID NO:4 or SEQ ID NO:6. 

In another aspect the invention provides methods for diagnosing induction of a TGF-p 
superfamily ligand in a cell. The methods include the steps of (a) measuring the amount of 
Smad7 RNA or polypeptide in the cell and comparing the result of step (a) with a control. 

25 According to still another aspect of the invention, methods are provided for determining 

the presence of a functional TGFp superfamily receptor in a cell. The methods include 
contacting the cell with an amount of TGFp superfamily ligand effective to increase the amount 
of Smad7 in the cell, measuring the amount of Smad7 RNA or polypeptide in the cell, and 
comparing the result of the measurement with a control, wherein an increased amount of Smad7 

30 RNA or polypeptide in the cell indicates the presence of a functional TGFp superfamily receptor. 
Preferably the TGFp superfamily receptor is selected from the group consisting of TGFp 
superfamily type I receptors, TGFp superfamily type II receptors, and complexes of TGFp 
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superfamily type I receptors and TGFp superfamily type II receptors. 

According to another aspect of the invention, methods are provided for identifying lead 
compounds for a pharmacological agent useful in the diagnosis or treatment of disease associated 
with TGFP superfamily signal transduction inhibitory activity of Smad7. One set of methods 
5 involves forming a mixture of a Smad7 polypeptide, a TGF-P superfamily receptor complex or 
an activated TGFp superfamily type I receptor, and a candidate pharmacological agent. The 
mixture is incubated under conditions which, in the absence of the candidate pharmacological 
agent, permit a first amount of specific binding of the TGF-P superfamily receptor complex or 
activated TGFP superfamily type I receptor by the Smad7 polypeptide. A test amount of the 

10 specific binding of the TGF-P superfamily receptor complex or activated type I receptor by the 
Smad7 polypeptide then is detected. Detection of an increase in the foregoing activity in the 
presence of the candidate pharmacological agent indicates that the candidate pharmacological 
agent is a lead compound for a pharmacological agent which increases the TGF-P superfamily 
signal transduction inhibitory activity of Smad7. Detection of a decrease in the foregoing 

15 activities in the presence of the candidate pharmacological agent indicates that the candidate 
pharmacological agent is a lead compound for a pharmacological agent which decreases the 
TGF-P superfamily signal transduction inhibitory activity of Smad7. Another set of methods 
involves forming a mixture as above, adding further a pathway specific Smad polypeptide, and 
detecting first and test amounts of TGF-P superfamily induced phosphorylation of the pathway 

20 specific Smad polypeptide. Detection of an increase in the phosphorylation in the presence of 
the candidate pharmacological agent indicates that the candidate pharmacological agent is a lead 
compound for a pharmacological agent which decreases the TGF-P superfamily signal 
transduction inhibitory activity of Smad7. Detection of a decrease in the foregoing activities in 
the presence of the candidate pharmacological agent indicates that the candidate pharmacological 

25 agent is a lead compound for a pharmacological agent which increases the TGF-P superfamily 
signal transduction inhibitory activity of Smad7. Preferred Smad7 polypeptides include the 
polypeptides of claim 18. 

In the foregoing compositions and methods, preferred members of the TGF-P 
superfamily are TGF-pl, activin, Vgl, BMP-4 and BMP-7, and the preferred pathway specific 

30 Smad polypeptides are Smadl, Smad2, Smad3 and Smad5. 

According to yet another aspect of the invention, a method for reducing expression of a 
Smad6 or Smad7 nucleic acid or expression product thereof in a cell is provided. The method 
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includes contacting the cell with an amount of an agent which binds selectively to Smad4 
effective to reduce the expression of the Smad6 or Smad7 nucleic acid or expression product 
thereof in the cell. In certain embodiments, the agent is an antisense Smad4 molecule, or an 
antibody that selectively binds to Smad4. 
5 In another aspect, the invention provides a method for increasing Smad6 or Smad7 

expression in a cell. The method includes contacting the cell with an agent selected from the 
group consisting of activin, epidermal growth factor and phorbol esters in an amount effective to 
increase Smad6 or Smad7 expression in the cell. 

According to another aspect of the invention, a method for treating a subject having lung 

10 cancer characterized by elevated expression of a Smad6 gene or a Smad7 gene is provided. The 
method includes administering to the subject an amount of an antisense nucleic acid which binds 
to the expression product of the Smad 6 or Smad7 gene effective to reduce the expression of the 
Smad 6 or Smad7 gene. In other embodiments, the method includes administering a 
polypeptide, such as an antibody or fragment thereof which binds a polypeptide selected from the 

15 group consisting of a polypeptide comprising the amino acid sequence of SEQ ID NO:4, a 
polypeptide comprising the amino acid sequence of SEQ ID NO: 6, a N -terminal fragment of 
Smad7 and a Oterminal fragment of Smad7, a polypeptide comprising the amino acid sequence 
of SEQ ID NO: 1 0, a polypeptide comprising the amino acid sequence of SEQ ID NO: 1 1 , a 
polypeptide comprising the amino acid sequence of SEQ ID NO: 12, a polypeptide comprising 

20 the amino acid sequence of SEQ ID NO: 13, and a polypeptide comprising the amino acid 

sequence of SEQ ID NO: 14. In still other embodiments, the agent is a dominant negative variant 
of Smad6 or Smad7. 

According to yet another aspect of the invention, methods for reducing eye defects in a 
developing mammalian embryo are provided. The methods include contacting the cells of the 

25 embryo with an agent which reduces the expression or activity of a Smad7 nucleic acid molecule 
or an expression product thereof. In certain embodiments, the agent selectively binds the Smad7 
nucleic acid molecule or an expression product thereof. In preferred embodiments, the agent is 
an antisense nucleic acid molecule or a polypeptide, and preferably the polypeptide is an 
antibody or fragment thereof which binds a polypeptide selected from the group consisting of a 

30 polypeptide comprising the amino acid sequence of SEQ ID NO:4, a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 6, a polypeptide comprising the amino acid sequence of 
SEQ ID NO: 13, a polypeptide comprising the amino acid sequence of SEQ ID NO: 14, a N- 
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terminal fragment of Smad7 and a C -terminal fragment of Smad7. The agent also can be a 
dominant negative variant of Smad7. 

According to another aspect of the invention, an isolated polypeptide is provided. The 
polypeptide includes a first polypeptide or fragment thereof linked to a second polypeptide or 
5 fragment thereof, wherein the second polypeptide or fragment thereof comprises a MH2 domain 
of Smad7. The polypeptide is localized in the nucleus of a cell and is exported from the nucleus 
to the cytoplasm of the cell upon TGFP superfamily receptor-mediated signal transduction in 
cells having a TGFp superfamily receptor. 

According to another aspect of the invention, a fusion protein is provided which includes 

10 a Smad7 MH2 domain or a nuclear localization fragment thereof. In other aspects of the 

invention, a fusion protein is provided which includes a Smad7 MH2 domain or a transcriptional 
activation fragment thereof. 

According to still another aspect of the invention, methods for modulating transcription 
of Smad7-regulated gene transcription are provided. The methods include contacting a 

15 mammalian cell with an agent which modulates TGFp superfamily receptor-mediated signal 
transduction in an amount effective to modulate Smad7-regulated gene transcription. In some 
embodiments the agent is a TGFp superfamily ligand or an inhibitor of TGFp superfamily 
receptor-mediated signal transduction. 

In another aspect of the invention, an isolated nucleic acid molecule is provided. The 

20 nucleic acid molecule includes the nucleotide sequence of SEQ ID NO: 1 5, an allelic variant 
thereof, or a functional fragment thereof which confers TGFP regulation. Nucleic acid 
molecules which include the nucleotide sequences which hybridize under stringent conditions to 
SEQ ID NO: 15 also are provided. In still other aspects of the invention, isolated nucleic acid 
molecules which include a unique fragment of SEQ ID NO: 1 5 are provided. Expression vectors 

25 which include the foregoing isolated nucleic acid molecules including or related to SEQ ID 
NO: 15 also are provided 

According to another aspect of the invention, method for regulating transcription of a first 
nucleic acid molecule are provided. The methods include preparing a construct comprising the 
first nucleic acid molecule operably linked to the foregoing isolated nucleic acid molecules 

30 including or related to SEQ ID NO: 15, and introducing the construct into an expression system. 
In some embodiments the expression system is a cell. In preferred embodiments, the cell 
expresses a TGFp superfamily receptor, and the method includes contacting the cell with a TGFp 
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superfamily ligand to increase expression of the first nucleic acid molecule. 

In still other aspects of the invention, methods for identifying modulators of TGFp- 
regulated transcriptional activity are provided. The methods include providing an expression 
system with a reporter construct including SEQ ID NO: 15 or a TGFp-regulated fragment thereof 
5 operably linked to a nucleic acid encoding a detectable expression product, and contacting the 
expression system with a candidate modulator compound. The expression system is incubated 
under conditions which, in the absence of the candidate modulator, permit a first amount of 
expression of the detectable expression product. A test amount of the expression the detectable 
expression product then is detected. Detection of an increase in the foregoing activity in the 

10 presence of the candidate modulator compound indicates that the candidate modulator compound 
is a compound which increases TGFp-regulated transcriptional activity. Detection of a decrease 
in the foregoing activities in the presence of the candidate modulator compound indicates that the 
candidate modulator compound is a compound which decreases TGFP-regulated transcriptional 
activity. In preferred embodiments the expression system is a cell or an in vitro transcription 

15 system. Other other preferred embodiments, the detectable expression product is a reporter 
protein, such as an enzyme, e.g., luciferase, or a green fluorescent protein. 

The use of the foregoing compositions in the preparation of a medicament is also 
contemplated. 

These and other aspects of the invention will be described in further detail in connection 
20 with the detailed description of the invention. 

Brief Description of the Figures 

Fig. 1 depicts the cloning and tissue distribution of Smad7. 

Fig. 2 shows that Smad7 inhibits TGF-p and activin induced transcriptional responses. 
25 Fig. 3 demonstrates the association of Smad7 with the TGF-p receptor complex. 

Fig. 4 shows that Smad7 is not phosphorylated upon TpR-activation, but Smad7 
overexpression inhibits TGF-P-mediated Smad2 and Smad3 phosphorylation. 
Fig. 5 shows that Smad7 is an early response gene for TGF-P . 

Fig. 6 demonstrates the association of Smad7 with BMP and activin receptor complexes. 
30 Fig. 7 shows that Smad7 inhibits BMP- and activin-receptor mediated phosphorylation of 

Smadl or Smad5. 

Fig. 8 demonstrates that the C-terminal domain of Smad7 associates with the TGF-p 
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receptor complex and that Smad8 and the N-terminal and Oterminal domains of Smad7 inhibit 
TGF-P signalling. 

Fig. 9 shows a comparison of Smad6 and Smad7. A: comparison of the amino acid 
sequences. B: Pairwise alignment relationship between hSmadl through hSmad7 and hSmad9. 
5 C: Expression of Smad6 and Smad7 mRNA in lung carcinoma cell lines. 

Fig. 10 shows that TGF-B superfamily members induce Smad6 and Smad7 mRNA 

levels. 

Fig. 1 1 shows that anti-sense Smad7 mRNA potentiates TGF-P 1 transcriptional response. 
Fig. 12 depicts the effect of EGF and PMA on Smad6 and Smad7 mRNA expression. 
10 Fig. 13 shows that Smad7 inhibits TGF-p-mediated signaling responses more effectively 

than Smad8. 

Fig. 14 shows that overexpression of Smad7 in Xenopus embryos induces formation of a 
partial secondary axis and eye defects. 

Fig. 1 5 shows characterization of expression of Smad7 and Smad6 in stable pMEP-4- 
1 5 driven transfected clones . 

Fig. 16 shows that Smad7 inhibits TGF-pl -mediated growth inhibition (A-C) Effective 
of Smad7 on TGF-pl -induced growth inhibition in three independent clones (D) Effective of 
Smad6S and (E) Smad6L on TGF-P 1 -induced growth inhibition. 

Fig. 17 shows that Smad7 inhibits TGF-P 1 -mediated induction of early response genes. 
20 Fig. 1 8 shows the subcellular distribution of Smad7 in the absence or presence of TGF- 

P 1 receptor activation. 

Fig. 19 shows the subcellular distribution of wild-type Smad7 and Smad7 deletion 
mutants in the absence or presence of TGF-pl stimulation. 

Fig. 20 depicts the results of a luciferase assays showing TGF-P 1 inducibility of a Smad7 
25 promoter fragment. 

Fig. 21 depicts several restriction enzyme sites and putative binding sites for different 
transcription factors and Smad proteins in the ~725bp Bam Hl-Xho I minimal promoter fragment 
(SEQ ID NO: 15). 



30 



Brief Description of the Sequences 

SEQ ID NO:l is the nucleotide sequence of the mouse EST (accession number 
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AA061644) which has sequence similarity to N-terminal Smad domains. 

SEQ ID NO:2 is the nucleotide sequence of the mouse EST (accession number 
AA022262) which has sequence similarity to C-terminal Smad domains. 

SEQ ID NO:3 is the nucleotide sequence of the mouse Smad7 cDNA. 
5 SEQ ID NO:4 is the amino acid sequence of the mouse Smad7 protein. 

SEQ ID NO: 5 is the nucleotide sequence of the human Smad7 cDNA. 

SEQ ID NO:6 is the amino acid sequence of the human Smad7 protein. 

SEQ ID NO: 7 is the nucleotide sequence of the coding region of the mouse Smad7 

cDNA. 

10 SEQ ID NO: 8 is the nucleotide sequence of the coding region of the human Smad7 

cDNA. 

SEQ ID NO: 9 is the nucleotide sequence of the human Smad6 cDNA. 
SEQ ID NO: 10 is the amino acid sequence of a Smad7 preferred peptide to which an 
antibody can be raised. 

15 SEQ ID NO: 1 1 is the amino acid sequence of a Smad7 preferred peptide to which an 

antibody can be raised. 

SEQ ID NO: 12 is the amino acid sequence of a Smad7 preferred peptide to which an 
antibody can be raised. 

SEQ ID NO: 13 is the amino acid sequence of a Smad7 preferred peptide to which an 
20 antibody can be raised. 

SEQ ID NO: 14 is the amino acid sequence of a Smad7 preferred peptide to which an 
antibody can be raised. 

SEQ ID NO: 15 is the 725 nucleotide Bam Hl-Xho I Smad7 minimal promoter fragment. 

25 

Detailed Description of the Invention 

The present invention in one aspect involves the cloning of a cDNA encoding a Smad7 
T0R-I receptor-interacting protein. The sequence of the mouse gene is presented as 
SEQ ID NO:3, and the predicted amino acid sequence of this gene's protein product is presented 
30 as SEQ ID NO:4. The sequence of the human gene is presented as SEQ ID NO:5 ? and the 
predicted amino acid sequence of this gene's protein product is presented as SEQ ID NO:6. 
Analysis of the sequence by comparison to nucleic acid and protein databases determined that 
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Smad7 has a C-terminal domain (the MH2 domain) which is related to other Smad proteins. The 
Smad7 C-terminal domain is most related to Smad6 (48% identity). 

The invention thus involves in one aspect Smad7 polypeptides, genes encoding those 
polypeptides, functional modifications and variants of the foregoing, useful fragments of the 
5 foregoing, as well as therapeutics relating thereto. The expression of these genes affects and is 
affected by TGF-(5 superfamily expression and signal transduction. The TGF-P superfamily 
members are well known to those of ordinary skill in the art and include TGF-[Js, activins, bone 
morphogenetic proteins (BMPs), Vgl, Mullerian inhibitory substance (MIS) and 
growth/differentiation factors (GDFs). 

10 Homologs and alleles of the Smad7 nucleic acids of the invention can be identified by 

conventional techniques. Thus, an aspect of the invention is those nucleic acid sequences which 
code for Smad7 polypeptides and which hybridize to a nucleic acid molecule consisting of the 
coding region of SEQ ID NO:3 or SEQ ID NO:5, under stringent conditions. The term 
"stringent conditions" as used herein refers to parameters with which the art is familiar. Nucleic 

15 acid hybridization parameters may be found in references which compile such methods, e.g. 
Molecular Cloning: A Laboratory Manual, J. Sambrook, et al., eds., Second Edition, Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989, or Current Protocols in 
Molecular Biology, F.M. Ausubel, et al., eds., John Wiley & Sons, Inc., New York. More 
specifically, stringent conditions, as used herein, refers, for example, to hybridization at 65°C in 

20 hybridization buffer (3.5 x SSC, 0.02% Ficoll, 0.02% polyvinyl pyrrolidone, 0.02% Bovine 
Serum Albumin, 2.5mM NaH 2 P0 4 (pH7), 0.5% SDS, 2mM EDTA). SSC is 0.1 5M sodium 
chloride/0.1 5 M sodium citrate, pH7; SDS is sodium dodecyl sulphate; and EDTA is 
ethylenediaminetetracetic acid. After hybridization, the membrane upon which the DNA is 
transferred is washed at 2 x SSC at room temperature and then at 0.1 x SSC/0. 1 x SDS at 

25 temperatures up to 65°C. 

There are other conditions, reagents, and so forth which can be used, which result in a 
similar degree of stringency. The skilled artisan will be familiar with such conditions, and thus 
they are not given here. It will be understood, however, that the skilled artisan will be able to 
manipulate the conditions in a manner to permit the clear identification of homologs and alleles 

30 of Smad7 nucleic acids of the invention. The skilled artisan also is familiar with the 

methodology for screening cells and libraries for expression of such molecules which then are 
routinely isolated, followed by isolation of the pertinent nucleic acid molecule and sequencing. 
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In general homologs and alleles typically will share at least 40% nucleotide identity 
and/or at least 50% amino acid identity to SEQ ID NOs:3 or 5 and SEQ ID NOs:4 or 6, 
respectively, in some instances will share at least 50% nucleotide identity and/or at least 65% 
amino acid identity and in still other instances will share at least 60% nucleotide identity and/or 
5 at least 75% amino acid identity. Watson-Crick complements of the foregoing nucleic acids also 
are embraced by the invention. 

In screening for Smad7 proteins, a Southern blot may be performed using the foregoing 
conditions, together with a radioactive probe. After washing the membrane to which the DNA is 
finally transferred, the membrane can be placed against X-ray film to detect the radioactive 
10 signal. 

The invention also includes degenerate nucleic acids which include alternative codons to 
those present in the native materials. For example, serine residues are encoded by the codons 
TCA, AGT, TCC, TCG, TCT and AGC. Each of the six codons is equivalent for the purposes of 
encoding a serine residue. Thus, it will be apparent to one of ordinary skill in the art that any of 

15 the serine-encoding nucleotide triplets may be employed to direct the protein synthesis apparatus, 
in vitro or in vivo, to incorporate a serine residue into an elongating Smad7 polypeptide. 
Similarly, nucleotide sequence triplets which encode other amino acid residues include, but are 
not limited to: CCA, CCC, CCG and CCT (proline codons); CGA, CGC, CGG, CGT, AGA and 
AGG (arginine codons); ACA, ACC, ACG and ACT (threonine codons); AAC and AAT 

20 (asparagine codons); and ATA, ATC and ATT (isoleucine codons). Other amino acid residues 
may be encoded similarly by multiple nucleotide sequences. Thus, the invention embraces 
degenerate nucleic acids that differ from the biologically isolated nucleic acids in codon 
sequence due to the degeneracy of the genetic code. 

The invention also provides isolated unique fragments of SEQ ID NOs:3 or 5 or 

25 complements of SEQ ID NOs:3 or 5. A unique fragment is one that is a 'signature' for the larger 
nucleic acid. It, for example, is long enough to assure that its precise sequence is not found in 
molecules outside of the Smad7 nucleic acids defined above. A unique fragment excludes, by 
definition, sequences consisting solely of EST sequences such as those described by SEQ ID 
NOs:l and 2. Unique fragments can be used as probes in Southern blot assays to identify such 

30 nucleic acids, or can be used in amplification assays such as those employing PCR. As known to 
those skilled in the art, large probes such as 200 nucleotides or more are preferred for certain 
uses such as Southern blots, while smaller fragments will be preferred for uses such as PCR. 
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Unique fragments also can be used to produce fusion proteins for generating antibodies or 
determining binding of the polypeptide fragments, as demonstrated in the Examples, or for 
generating immunoassay components. Likewise, unique fragments can be employed to produce 
nonfused fragments of the Smad7 polypeptides such as the N-terminal and C-terminal fragments 
5 disclosed herein, useful, for example, in the preparation of antibodies, in immunoassays, and as a 
competitive binding partner of the TGF-P, activin, BMP receptors and/or other polypeptides 
which bind to the Smad7 polypeptides, for example, in therapeutic applications. Unique 
fragments further can be used as antisense molecules to inhibit the expression of Smad7 nucleic 
acids and polypeptides, particularly for therapeutic purposes as described in greater detail below. 

10 As will be recognized by those skilled in the art, the size of the unique fragment will 

depend upon its conservancy in the genetic code. Thus, some regions of SEQ ID NO: 3 and/or 
SEQ ID NO: 5 and its complement will require longer segments to be unique while others will 
require only short segments, typically between 12 and 32 nucleotides (e.g. 12, 13, 14, 15, 16, 17, 
18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31 and 32 bases long). Virtually any segment of 

15 SEQ ID NO: 7 or SEQ ID NO:8, or complements thereof, that is 1 8 or more nucleotides in length 
will be unique. Those skilled in the art are well versed in methods for selecting such sequences, 
typically on the basis of the ability of the unique fragment to selectively distinguish the sequence 
of interest from non-Smad7 nucleic acids. A comparison of the sequence of the fragment to 
those on known data bases typically is all that is necessary, although in vitro confirmatory 

20 hybridization and sequencing analysis may be performed. 

A unique fragment can be a functional fragment. A functional fragment of a nucleic acid 
molecule of the invention is a fragment which retains some functional property of the larger 
nucleic acid molecule, such as coding for a functional polypeptide, binding to proteins, 
regulating transcription of operably linked nucleic acids, and the like. One of ordinary skill in 

25 the art can readily determine using the assays described herein and those well known in the art to 
determine whether a fragment is a functional fragment of a nucleic acid molecule using no more 
than routine experimentation. 

As mentioned above, the invention embraces antisense oligonucleotides that selectively 
bind to a nucleic acid molecule encoding a Smad7 polypeptide, to increase TGF-P, activin and/or 

30 BMP signaling by reducing the amount of Smad7. This is desirable in virtually any medical 
condition wherein a reduction of Smad7 is desirable, e.g., to increase TGF-p signaling. 
As used herein, the term "antisense oligonucleotide" or "antisense" describes an 
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oligonucleotide that is an oligoribonucleotide, oligodeoxyribonucleotide, modified 
oligoribonucleotide, or modified oligodeoxyribonucleotide which hybridizes under physiological 
conditions to DNA comprising a particular gene or to an mRNA transcript of that gene and, 
thereby, inhibits the transcription of that gene and/or the translation of that mRNA. The 
5 antisense molecules are designed so as to interfere with transcription or translation of a target 
gene upon hybridization with the target gene or transcript. Those skilled in the art will recognize 
that the exact length of the antisense oligonucleotide and its degree of complementarity with its 
target will depend upon the specific target selected, including the sequence of the target and the 
particular bases which comprise that sequence. It is preferred that the antisense oligonucleotide 

10 be constructed and arranged so as to bind selectively with the target under physiological 

conditions, i.e., to hybridize substantially more to the target sequence than to any other sequence 
in the target cell under physiological conditions. Based upon SEQ ID NOs:3, 5 or 9, or upon 
allelic or homologous genomic and/or cDNA sequences, one of skill in the art can easily choose 
and synthesize any of a number of appropriate antisense molecules for use in accordance with the 

15 present invention. In order to be sufficiently selective and potent for inhibition, such antisense 
oligonucleotides should comprise at least 10 and, more preferably, at least 15 consecutive bases 
which are complementary to the target, although in certain cases modified oligonucleotides as 
short as 7 bases in length have been used successfully as antisense oligonucleotides (Wagner et 
al., Nature BiotechnoL 14:840-844, 1996). Most preferably, the antisense oligonucleotides 

20 comprise a complementary sequence of 20-30 bases. Although oligonucleotides may be chosen 
which are antisense to any region of the gene or mRNA transcripts, in preferred embodiments the 
antisense oligonucleotides correspond to N-terminal or 5' upstream sites such as translation 
initiation, transcription initiation or promoter sites. In addition, 3-untranslated regions may be 
targeted. Targeting to mRNA splicing sites has also been used in the art but may be less 

25 preferred if alternative mRNA splicing occurs. In addition, the antisense is targeted, preferably, 
to sites in which mRNA secondary structure is not expected (see, e.g., Sainio et al., Cell Mol 
Neurobiol 14(5):439-457, 1 994) and at which proteins are not expected to bind. Finally, 
although SEQ ID Nos:3, 5 and 9 disclose cDNA sequences, one of ordinary skill in the art may 
easily derive the genomic DNA corresponding to the cDNA of SEQ ID Nos:3, 5 or 9. Thus, the 

30 present invention also provides for antisense oligonucleotides which are complementary to the 
genomic DNA corresponding to SEQ ID Nos:3, 5 and 9. Similarly, antisense to allelic or 
homologous cDNAs and genomic DNAs are enabled without undue experimentation. 
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In one set of embodiments, the antisense oligonucleotides of the invention may be 
composed of "natural" deoxyribonucleotides, ribonucleotides, or any combination thereof. That 
is, the 5' end of one native nucleotide and the 3' end of another native nucleotide may be 
covalently linked, as in natural systems, via a phosphodiester internucleoside linkage. These 
5 oligonucleotides may be prepared by art recognized methods which may be carried out manually 
or by an automated synthesizer. They also may be produced recombinantly by vectors. 

In preferred embodiments, however, the antisense oligonucleotides of the invention also 
may include "modified" oligonucleotides. That is, the oligonucleotides may be modified in a 
number of ways which do not prevent them from hybridizing to their target but which enhance 

10 their stability or targeting or which otherwise enhance their therapeutic effectiveness. 

The term "modified oligonucleotide" as used herein describes an oligonucleotide in 
which (1) at least two of its nucleotides are covalently linked via a synthetic internucleoside 
linkage (i.e., a linkage other than a phosphodiester linkage between the 5' end of one nucleotide 
and the 3' end of another nucleotide) and/or (2) a chemical group not normally associated with 

15 nucleic acids has been covalently attached to the oligonucleotide. Preferred synthetic 
internucleoside linkages are phosphorothioates, alkylphosphonates, phosphorodithioates, 
phosphate esters, alkylphosphonothioates, phosphoramidates, carbamates, carbonates, phosphate 
triesters, acetamidates, carboxymethyl esters and peptides. 

The term "modified oligonucleotide" also encompasses oligonucleotides with a 

20 covalently modified base and/or sugar. For example, modified oligonucleotides include 

oligonucleotides having backbone sugars which are covalently attached to low molecular weight 
organic groups other than a hydroxyl group at the 3* position and other than a phosphate group at 
the 5' position. Thus modified oligonucleotides may include a 2'-0-alkylated ribose group. In 
addition, modified oligonucleotides may include sugars such as arabinose instead of ribose. The 

25 present invention, thus, contemplates pharmaceutical preparations containing modified antisense 
molecules that are complementary to and hybridizable with, under physiological conditions, 
nucleic acids encoding Smad7 polypeptides, together with pharmaceutically acceptable carriers. 

Antisense oligonucleotides may be administered as part of a pharmaceutical composition. 
Such a pharmaceutical composition may include the antisense oligonucleotides in combination 

30 with any standard physiologically and/or pharmaceutically acceptable carriers which are known 
in the art. The compositions should be sterile and contain a therapeutically effective amount of 
the antisense oligonucleotides in a unit of weight or volume suitable for administration to a 
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patient. The term "pharmaceutically acceptable" means a non-toxic material that does not 
interfere with the effectiveness of the biological activity of the active ingredients. The term 
"physiologically acceptable" refers to a non-toxic material that is compatible with a biological 
system such as a cell, cell culture, tissue, or organism. The characteristics of the carrier will 
5 depend on the route of administration. Physiologically and pharmaceutically acceptable carriers 
include diluents, fillers, salts, buffers, stabilizers, solubilizers, and other materials which are well 
known in the art. 

As used herein, a "vector" may be any of a number of nucleic acids into which a desired 
sequence may be inserted by restriction and ligation for transport between different genetic 

10 environments or for expression in a host cell. Vectors are typically composed of DNA although 
RNA vectors are also available. Vectors include, but are not limited to, plasmids, phagemids and 
virus genomes. A cloning vector is one which is able to replicate in a host cell, and which is 
further characterized by one or more endonuclease restriction sites at which the vector may be 
cut in a determinable fashion and into which a desired DNA sequence may be ligated such that 

15 the new recombinant vector retains its ability to replicate in the host cell. In the case of 

plasmids, replication of the desired sequence may occur many times as the plasmid increases in 
copy number within the host bacterium or just a single time per host before the host reproduces 
by mitosis. In the case of phage, replication may occur actively during a lytic phase or passively 
during a lysogenic phase. An expression vector is one into which a desired DNA sequence may 

20 be inserted by restriction and ligation such that it is operably joined to regulatory sequences and 
may be expressed as an RNA transcript. Vectors may further contain one or more marker 
sequences suitable for use in the identification of cells which have or have not been transformed 
or transfected with the vector. Markers include, for example, genes encoding proteins which 
increase or decrease either resistance or sensitivity to antibiotics or other compounds, genes 

25 which encode enzymes whose activities are detectable by standard assays known in the art (e.g., 
B-galactosidase or alkaline phosphatase), and genes which visibly affect the phenotype of 
transformed or transfected cells, hosts, colonies or plaques (e.g., green fluorescent protein). 
Preferred vectors are those capable of autonomous replication and expression of the structural 
gene products present in the DNA segments to which they are operably joined. 

30 As used herein, a coding sequence and regulatory sequences are said to be "operably" 

joined when they are covalently linked in such a way as to place the expression or transcription 
of the coding sequence under the influence or control of the regulatory sequences. If it is desired 
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that the coding sequences be translated into a functional protein, two DNA sequences are said to 
be operably joined if induction of a promoter in the 5' regulatory sequences results in the 
transcription of the coding sequence and if the nature of the linkage between the two DNA 
sequences does not (1) result in the introduction of a frame-shift mutation, (2) interfere with the 
5 ability of the promoter region to direct the transcription of the coding sequences, or (3) interfere 
with the ability of the corresponding RNA transcript to be translated into a protein. Thus, a 
promoter region would be operably joined to a coding sequence if the promoter region were 
capable of effecting transcription of that DNA sequence such that the resulting transcript might 
be translated into the desired protein or polypeptide. 

10 The precise nature of the regulatory sequences needed for gene expression may vary 

between species or cell types, but shall in general include, as necessary, 5' non-transcribed and 5' 
non-translated sequences involved with the initiation of transcription and translation respectively, 
such as a TATA box, capping sequence, CAAT sequence, and the like. Especially, such 5 5 non- 
transcribed regulatory sequences will include a promoter region which includes a promoter 

15 sequence for transcriptional control of the operably joined gene. Regulatory sequences may also 
include enhancer sequences or upstream activator sequences as desired. The vectors of the 
invention may optionally include 5' leader or signal sequences. The choice and design of an 
appropriate vector is within the ability and discretion of one of ordinary skill in the art. 

Expression vectors containing all the necessary elements for expression are commercially 

20 available and known to those skilled in the art. See, e.g., Sambrook et al., Molecular Cloning: A 
Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press, 1989. Cells are 
genetically engineered by the introduction into the cells of heterologous DNA (RNA) encoding 
Smad7 polypeptide or fragment or variant thereof. That heterologous DNA (RNA) is placed 
under operable control of transcriptional elements to permit the expression of the heterologous 

25 DNA in the host cell. 

Preferred systems for mRNA expression in mammalian cells are those such as pRc/CMV 
(available from Invitrogen, Carlsbad, CA) that contain a selectable marker such as a gene that 
confers G418 resistance (which facilitates the selection of stably transfected cell lines) and the 
human cytomegalovirus (CMV) enhancer-promoter sequences. Additionally, suitable for 

30 expression in primate or canine cell lines is the pCEP4 vector (Invitrogen), which contains an 
Epstein Barr virus (EBV) origin of replication, facilitating the maintenance of plasmid as a 
multicopy extrachromosomal element. Another expression vector is the pEF-BOS plasmid 
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containing the promoter of polypeptide Elongation Factor 1 a, which stimulates efficiently 
transcription in vitro. The plasmid is described by Mishizuma and Nagata (Nuc. Acids Res. 
18:5322, 1990), and its use in transfection experiments is disclosed by, for example, Demoulin 
(Mol. Cell. Biol 16:4710-4716, 1996). Still another preferred expression vector is an 
5 adenovirus, described by Stratford-Perricaudet, which is defective for El and E3 proteins (J. 
Clin. Invest. 90:626-630, 1992). The use of the adenovirus as an Adeno.Pl A recombinant is 
disclosed by Warnier et al., in intradermal injection in mice for immunization against PI A (Int. J. 
Cancer, 67:303-310, 1996). 

The invention also embraces so-called expression kits, which allow the artisan to prepare 

10 a desired expression vector or vectors. Such expression kits include at least separate portions of 
each of the previously discussed coding sequences. Other components may be added, as desired, 
as long as the previously mentioned sequences, which are required, are included. 

The invention also permits the construction of Smad7 gene "knock-outs" in cells and in 
animals, providing materials for studying certain aspects of TGF-0, activin and/or BMP signal 

15 transduction. 

The invention also provides isolated polypeptides, which include the polypeptides of 
SEQ ID NOs:4 and 6 and unique fragments of SEQ ID NOs:4 and 6 including fragments 
comprising amino acids 2-261 of SEQ ID NOs:4 and 6 and amino acids 204-426 of SEQ ID 
NOs:4 and 6. Such polypeptides are useful, for example, alone or as fusion proteins to generate 

20 antibodies, as a components of an immunoassay. 

A unique fragment of an Smad7 polypeptide, in general, has the features and 
characteristics of unique fragments as discussed above in connection with nucleic acids. As will 
be recognized by those skilled in the art, the size of the unique fragment will depend upon factors 
such as whether the fragment constitutes a portion of a conserved protein domain. Thus, some 

25 regions of amino acids 2-26 1 of SEQ ID NOs:4 and 6 and amino acids 204-426 of SEQ ID 
NOs:4 and 6 will require longer segments to be unique while others will require only short 
segments, typically between 5 and 12 amino acids (e.g. 5, 6, 7, 8, 9, 10, 11 and 12 amino acids 
long). Virtually any segment of amino acids 2-261 of SEQ ID NOs:4 and 6 and amino acids 
204-426 of SEQ ID NOs:4 and 6, that is 10 or more amino acids in length will be unique. 

30 Unique fragments of a polypeptide preferably are those fragments which retain a distinct 

functional capability of the polypeptide. Functional capabilities which can be retained in a 
unique fragment of a polypeptide include interaction with antibodies, interaction with other 
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polypeptides (such as TpR-I) or fragments thereof, selective binding of nucleic acids or proteins, 
and enzymatic activity. For example, as exemplified herein, N-terminal and C-terminal Smad7 
fragments such as those which includes amino acids 2-261or 204-426 of SEQ ID NO:4 or SEQ 
ID NO: 6 can be used as a functional equivalent of full length Smad7 in the methods of the 
5 invention, including e.g., inhibition of TGF-[5 signal transduction. Other Smad polypeptide 
fragments, e.g., other N-terminal or C-terminal fragments, can be selected according to their 
functional properties. For example, one of ordinary skill in the art can prepare Smad7 fragments 
recombinantly and test those fragments according to the methods expemplified below, such as 
binding to a TGFp superfamily receptor, or inhibition of pathway specific Smad polypeptide 

10 phosphorylation. Those skilled in the art also are well versed in methods for selecting unique 
amino acid sequences, typically on the basis of the ability of the unique fragment to selectively 
distinguish the sequence of interest from non-family members. A comparison of the sequence of 
the fragment to those on known data bases typically is all that is necessary. 

The invention embraces variants of the Smad7 polypeptides described above. As used 

15 herein, a "variant" of a Smad7 polypeptide is a polypeptide which contains one or more 

modifications to the primary amino acid sequence of a Smad7 polypeptide. Modifications which 
create a Smad7 variant can be made to a Smad7 polypeptide 1) to reduce or eliminate an activity 
of a Smad7 polypeptide, such as binding to TpR-I; 2) to enhance a property of a Smad7 
polypeptide, such as protein stability in an expression system or the stability of protein-protein 

20 binding; or 3) to provide a novel activity or property to a Smad7 polypeptide, such as addition of 
an antigenic epitope or addition of a detectable moiety. Modifications to a Smad7 polypeptide 
are typically made to the nucleic acid which encodes the Smad7 polypeptide, and can include 
deletions, point mutations, truncations, amino acid substitutions and additions of amino acids or 
non-amino acid moieties. Alternatively, modifications can be made directly to the polypeptide, 

25 such as by cleavage, addition of a linker molecule, addition of a detectable moiety, such as 
biotin, addition of a fatty acid, and the like. Modifications also embrace fusion proteins 
comprising all or part of the Smad7 amino acid sequence. 

In general, variants include Smad7 polypeptides which are modified specifically to alter a 
feature of the polypeptide unrelated to its physiological activity. For example, cysteine residues 

30 can be substituted or deleted to prevent unwanted disulfide linkages. Similarly, certain amino 
acids can be changed to enhance expression of a Smad7 polypeptide by eliminating proteolysis 
by proteases in an expression system (e.g., dibasic amino acid residues in yeast expression 
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systems in which KEX2 protease activity is present). 

Mutations of a nucleic acid which encode a Smad7 polypeptide preferably preserve the 
amino acid reading frame of the coding sequence, and preferably do not create regions in the 
nucleic acid which are likely to hybridize to form secondary structures, such a hairpins or loops, 
5 which can be deleterious to expression of the variant polypeptide. 

Mutations can be made by selecting an amino acid substitution, or by random 
mutagenesis of a selected site in a nucleic acid which encodes the polypeptide. Variant 
polypeptides are then expressed and tested for one or more activities to determine which 
mutation provides a variant polypeptide with the desired properties. Further mutations can be 

10 made to variants (or to non- variant Smad7 polypeptides) which are silent as to the amino acid 
sequence of the polypeptide, but which provide preferred codons for translation in a particular 
host. The preferred codons for translation of a nucleic acid in, e.g., E. coli, are well known to 
those of ordinary skill in the art. Still other mutations can be made to the noncoding sequences 
of a Smad7 gene or cDNA clone to enhance expression of the polypeptide. The activity of 

15 variants of Smad7 polypeptides can be tested by cloning the gene encoding the variant Smad7 
polypeptide into a bacterial or mammalian expression vector, introducing the vector into an 
appropriate host cell, expressing the variant Smad7 polypeptide, and testing for a functional 
capability of the Smad7 polypeptides as disclosed herein. For example, the variant Smad7 
polypeptide can be tested for inhibition of T(iR-I, activin and/or BMP receptor signaling activity 

20 as disclosed in the Examples, or for inhibition of Smadl, Smad2, Smad3 and/or SmadS 

phosphorylation as is also disclosed herein. Preparation of other variant polypeptides may favor 
testing of other activities, as will be known to one of ordinary skill in the art. 

The skilled artisan will also realize that conservative amino acid substitutions may be 
made in Smad7 polypeptides to provide functionally equivalent variants of the foregoing 

25 polypeptides, i.e, the variants retain the functional capabilities of the Smad7 polypeptides. As 
used herein, a "conservative amino acid substitution" refers to an amino acid substitution which 
does not alter the relative charge or size characteristics of the protein in which the amino acid 
substitution is made. Variants can be prepared according to methods for altering polypeptide 
sequence known to one of ordinary skill in the art such as are found in references which compile 

30 such methods, e.g. Molecular Cloning: A Laboratory Manual, J. Sambrook, et aL, eds., Second 
Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989, or Current 
Protocols in Molecular Biology, F.M. Ausubel, et aL, eds., John Wiley & Sons, Inc., New York. 
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Exemplary functionally equivalent variants of the Smad7 polypeptides include conservative 
amino acid substitutions of SEQ ID NOs:4 or 6. Conservative substitutions of amino acids 
include substitutions made amongst amino acids within the following groups: (a) M, 1, L, V; (b) 
F, Y, W; (c) K, R, H; (d) A, G; (e) S, T; (f) Q, N; and (g) E, D. 
5 Conservative amino-acid substitutions in the amino acid sequence of Smad7 polypeptides 

to produce functionally equivalent variants of Smad7 polypeptides typically are made by 
alteration of a nucleic acid encoding a Smad7 polypeptide (SEQ ID NOs:3 and 5). Such 
substitutions can be made by a variety of methods known to one of ordinary skill in the art. For 
example, amino acid substitutions may be made by PCR-directed mutation, site-directed 

10 mutagenesis according to the method of Kunkel (Kunkel, Proc. Nat. Acad. Sci. U.S.A. 82: 488- 
492, 1985), or by chemical synthesis of a gene encoding a Smad7 polypeptide. Where amino 
acid substitutions are made to a small unique fragment of a Smad7 polypeptide, such as a TpR-I 
binding site peptide, the substitutions can be made by directly synthesizing the peptide. The 
activity of functionally equivalent fragments of Smad7 polypeptides can be tested by cloning the 

15 gene encoding the altered Smad7 polypeptide into a bacterial or mammalian expression vector, 
introducing the vector into an appropriate host cell, expressing the altered Smad7 polypeptide, 
and testing for a functional capability of the Smad7 polypeptides as disclosed herein. Peptides 
which are chemically synthesized can be tested directly for function, e.g., for binding to TpR-I, 
ActR-IB and/or BMPR-IB. 

20 The invention as described herein has a number of uses, some of which are described 

elsewhere herein. First, the invention permits isolation of the Smad7 protein molecules (SEQ ID 
NOs:4 and 6). A variety of methodologies well-known to the skilled practitioner can be utilized 
to obtain isolated Smad7 molecules. The polypeptide may be purified from cells which naturally 
produce the polypeptide by chromatographic means or immunological recognition. 

25 Alternatively, an expression vector may be introduced into cells to cause production of the 
polypeptide. In another method, mRNA transcripts may be microinjected or otherwise 
introduced into cells to cause production of the encoded polypeptide. Translation of mRNA in 
cell-free extracts such as the reticulocyte lysate system also may be used to produce polypeptide. 
Those skilled in the art also can readily follow known methods for isolating Smad7 polypeptides. 

30 These include, but are not limited to, immunochromatography, HPLC, size-exclusion 
chromatography, ion-exchange chromatography and immune-affinity chromatography. 

The isolation of the Smad7 gene also makes it possible for the artisan to diagnose a 
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disorder characterized by expression of Smad7. These methods involve determining expression 
of the Smad7 gene, and/or Smad7 polypeptides derived therefrom. In the former situation, such 
determinations can be carried out via any standard nucleic acid determination assay, including 
the polymerase chain reaction as exemplified in the examples below, or assaying with labeled 
5 hybridization probes. 

The invention also makes it possible isolate proteins such as TpR-I, ActR-IB and 
BMPR-IB by the binding of such proteins to Smad7 as disclosed herein. The identification of 
this binding also permits one of skill in the art to block the binding of Smad7 to other proteins, 
such as TpR-I, as well as blocking the binding of other Smads, such as Smad2 or Smad3 to TpR- 

10 I receptors. Binding of the proteins can be effected by introducing into a biological system in 
which the proteins bind (e.g., a cell) a polypeptide including a Smad7 TpR-I binding site in an 
amount sufficient to block the binding. The identification of a TpR-I domain binding site in 
Smad7 also enables one of skill in the art to prepare modified proteins, using standard 
recombinant DNA techniques, which can bind to proteins such as TpR-I, ActR-IB and 

15 BMPR-IB. For example, when one desires to target a certain protein to a TGF-p receptor 

complex, one can prepare a fusion polypeptide of the protein and the Smad7 TpR-I binding site. 
Additional uses are described further herein. 

The invention further provides methods for reducing or increasing TGF-p family signal 
transduction in a cell. Such methods are useful in vitro for altering the TGF-P signal 

20 transduction, for example, in testing compounds for potential to block aberrant TGF-P signal 
transduction or increase deficient TGF-P signal transduction. In vivo, such methods are useful 
for modulating growth, e.g., to treat cancer and fibrosis. Increasing TGF-P signal transduction in 
a cell by, e.g., introducing a dominant negative Smad7 polypeptide or Smad7 antisense 
oligonucleotides in the cell, can be used to provide a model system for testing the effects of 

25 putative inhibitors of TGF-p signal transduction. Such methods also are useful in the treatment 
of conditions which result from excessive or deficient TGF-P signal transduction. TGF-P signal 
transduction can be measured by a variety of ways known to one of ordinary skill in the art, such 
as the reporter systems described in the Examples. Various modulators of Smad7 activity can be 
screened for effects on TGF-P signal transduction using the methods disclosed herein. The 

30 skilled artisan can first determine the modulation of a Smad7 activity, such as TGF-p signaling 
activity, and then apply such a modulator to a target cell or subject and assess the effect on the 
target cell or subject. For example, in screeing for modulators of Smad7 useful in the treatment 
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of cancer, cells in culture can be contacted with Smad7 modulators and the increase or decrease 
of growth or focus formation of the cells can be determined according to standard procedures. 
Smad7 activity modulators can be assessed for their effects on other TGF-p signal transduction 
downstream effects by similar methods in many cell types. The foregoing also applies to 
5 signalling via activin and BMP complexes. 

The invention also provides, in certain embodiments, "dominant negative" polypeptides 
derived from SEQ ID NOs:4 and/or 6. A dominant negative polypeptide is an inactive variant of 
a protein, which, by interacting with the cellular machinery, displaces an active protein from its 
interaction with the cellular machinery or competes with the active protein, thereby reducing the 

10 effect of the active protein. For example, a dominant negative receptor which binds a ligand but 
does not transmit a signal in response to binding of the ligand can reduce the biological effect of 
expression of the ligand. Likewise, a dominant negative catalytically-inactive kinase which 
interacts normally with target proteins but does not phosphorylate the target proteins can reduce 
phosphorylation of the target proteins in response to a cellular signal. Similarly, a dominant 

15 negative transcription factor which binds to a promoter site in the control region of a gene but 
does not increase gene transcription can reduce the effect of a normal transcription factor by 
occupying promoter binding sites without increasing transcription. 

The end result of the expression of a dominant negative polypeptide in a cell is a 
reduction in function of active proteins. One of ordinary skill in the art can assess the potential 

20 for a dominant negative variant of a protein, and using standard mutagenesis techniques to create 
one or more dominant negative variant polypeptides. For example, given the teachings contained 
herein of a Smad7 polypeptide, one of ordinary skill in the art can modify the sequence of the 
Smad7 polypeptide by site-specific mutagenesis, scanning mutagenesis, partial gene deletion or 
truncation, and the like. See, e.g., U.S. Patent No. 5,580,723 and Sambrook et al., Molecular 

25 Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press, 1989. 
The skilled artisan then can test the population of mutagenized polypeptides for diminution in a 
selected activity (e.g., Smad7 reduction of TGF-(5 signalling activity) and/or for retention of such 
an activity. Other similar methods for creating and testing dominant negative variants of a 
protein will be apparent to one of ordinary skill in the art. 

30 Dominant negative Smad7 proteins include variants in which a portion of the TpR-I, 

activin receptor or BMP receptor binding site has been mutated or deleted to reduce or eliminate 
Smad7 interaction with the TGF-p, activin, or BMP receptor complex respectively. Other 
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examples include Smad7 variants in which the ability to inhibit phosphorylation of Smad2 and/or 
Smad3 is reduced. One of ordinary skill in the art can readily prepare and test Smad7 variants 
bearing mutations or deletions in the C-terminal domain (e.g., in the MH2 domain) or in the N- 
terminal domain (e.g., in the glycine/glutamic acid residue rich region). 
5 The invention also involves agents such as polypeptides which bind to Smad7 

polypeptides and to complexes of Smad7 polypeptides and binding partners such as T0R-I, 
ActR-IB and BMPR-IB. Such binding agents can be used, for example, in screening assays to 
detect the presence or absence of Smad7 polypeptides and complexes of Smad7 polypeptides and 
their binding partners and in purification protocols to isolate Smad7 polypeptides and complexes 

10 of Smad7 polypeptides and their binding partners. Such agents also can be used to inhibit the 
native activity of the Smad7 polypeptides or their binding partners, for example, by binding to 
such polypeptides, or their binding partners or both. 

The invention, therefore, embraces peptide binding agents which, for example, can be 
antibodies or fragments of antibodies having the ability to selectively bind to Smad7 

15 polypeptides. Antibodies include polyclonal and monoclonal antibodies, prepared according to 
conventional methodology. 

Significantly, as is well-known in the art, only a small portion of an antibody molecule, 
the paratope, is involved in the binding of the antibody to its epitope (see, in general, Clark, 
W.R. (1986) The Experimental Foundations of Modern Immunology Wiley & Sons, Inc., New 

20 York; Roitt, I. (1991) Essential Immunology , 7th Ed., Blackwell Scientific Publications, 

Oxford). The pFc' and Fc regions, for example, are effectors of the complement cascade but are 
not involved in antigen binding. An antibody from which the pFc' region has been enzymatically 
cleaved, or which has been produced without the pFc' region, designated an F(ab') 2 fragment, 
retains both of the antigen binding sites of an intact antibody. Similarly, an antibody from which 

25 the Fc region has been enzymatically cleaved, or which has been produced without the Fc region, 
designated an Fab fragment, retains one of the antigen binding sites of an intact antibody 
molecule. Proceeding further, Fab fragments consist of a covalently bound antibody light chain 
and a portion of the antibody heavy chain denoted Fd. The Fd fragments are the major 
determinant of antibody specificity (a single Fd fragment may be associated with up to ten 

30 different light chains without altering antibody specificity) and Fd fragments retain epitope- 
binding ability in isolation. 

Within the antigen-binding portion of an antibody, as is well-known in the art, there are 
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complementarity determining regions (CDRs), which directly interact with the epitope of the 
antigen, and framework regions (FRs), which maintain the tertiary structure of the paratope (see, 
in general, Clark, 1986; Roitt, 1991). In both the heavy chain Fd fragment and the light chain of 
IgG immunoglobulins, there are four framework regions (FR1 through FR4) separated 
5 respectively by three complementarity determining regions (CDR1 through CDR3). The CDRs, 
and in particular the CDR3 regions, and more particularly the heavy chain CDR3, are largely 
responsible for antibody specificity. 

It is now well-established in the art that the non-CDR regions of a mammalian antibody 
may be replaced with similar regions of conspecific or heterospecific antibodies while retaining 

10 the epitopic specificity of the original antibody. This is most clearly manifested in the 

development and use of "humanized" antibodies in which non-human CDRs are covalently 
joined to human FR and/or Fc/pFc f regions to produce a functional antibody. Thus, for example, 
PCT International Publication Number WO 92/04381 teaches the production and use of 
humanized murine RS V antibodies in which at least a portion of the murine FR regions have 

15 been replaced by FR regions of human origin. Such antibodies, including fragments of intact 
antibodies with antigen-binding ability, are often referred to as "chimeric" antibodies. 

Thus, as will be apparent to one of ordinary skill in the art, the present invention also 
provides for F(ab') 2 , Fab, Fv and Fd fragments; chimeric antibodies in which the Fc and/or FR 
and/or CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by homologous 

20 human or non-human sequences; chimeric F(ab') 2 fragment antibodies in which the FR and/or 
CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by homologous human 
or non-human sequences; chimeric Fab fragment antibodies in which the FR and/or CDR1 and/or 
CDR2 and/or light chain CDR3 regions have been replaced by homologous human or non- 
human sequences; and chimeric Fd fragment antibodies in which the FR and/or CDR1 and/or 

25 CDR2 regions have been replaced by homologous human or non-human sequences. The present 
invention also includes so-called single chain antibodies. 

Thus, the invention involves polypeptides of numerous size and type that bind 
specifically to Smad7 polypeptides, and complexes of both Smad7 polypeptides and their 
binding partners. These polypeptides may be derived also from sources other than antibody 

30 technology. For example, such polypeptide binding agents can be provided by degenerate 
peptide libraries which can be readily prepared in solution, in immobilized form or as phage 
display libraries. Combinatorial libraries also can be synthesized of peptides containing one or 
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more amino acids. Libraries further can be synthesized of peptoids and non-peptide synthetic 
moieties. 

Phage display can be particularly effective in identifying binding peptides useful 
according to the invention. Briefly, one prepares a phage library (using e.g. ml 3, fd, or lambda 
5 phage), displaying inserts from 4 to about 80 amino acid residues using conventional procedures. 
The inserts may represent, for example, a completely degenerate or biased array. One then can 
select phage-bearing inserts which bind to the Smad7 polypeptide. This process can be repeated 
through several cycles of reselection of phage that bind to the Smad7 polypeptide. Repeated 
rounds lead to enrichment of phage bearing particular sequences. DNA sequence analysis can be 

10 conducted to identify the sequences of the expressed polypeptides. The minimal linear portion of 
the sequence that binds to the Smad7 polypeptide can be determined. One can repeat the 
procedure using a biased library containing inserts containing part or all of the minimal linear 
portion plus one or more additional degenerate residues upstream or downstream thereof. Yeast 
two-hybrid screening methods also may be used to identify polypeptides that bind to the Smad7 

15 polypeptides. Thus, the Smad7 polypeptides of the invention, or a fragment thereof, can be used 
to screen peptide libraries, including phage display libraries, to identify and select peptide 
binding partners of the Smad7 polypeptides of the invention. Such molecules can be used, as 
described, for screening assays, for purification protocols, for interfering directly with the 
functioning of Smad7 and for other purposes that will be apparent to those of ordinary skill in the 

20 art. 

A Smad7 polypeptide, or a fragment thereof, also can be used to isolate their native 
binding partners, including, e.g., the TGF-p, activin, or BMP receptor complexes. Isolation of 
such binding partners may be performed according to well-known methods. For example, 
isolated Smad7 polypeptides can be attached to a substrate (e.g., chromatographic media, such as 

25 polystyrene beads, or a filter), and then a solution suspected of containing the TGF-P receptor 
complex may be applied to the substrate. If a TGF-P receptor complex which can interact with 
Smad7 polypeptides is present in the solution, then it will bind to the substrate-bound Smad7 
polypeptide. The TGF-p receptor complex then may be isolated. Other proteins which are 
binding partners for Smad7, such as other Smads, activin receptor complexes, and BMP receptor 

30 complexes may be isolated by similar methods without undue experimentation. 

It will also be recognized that the invention embraces the use of the Smad7 cDNA 
sequences in expression vectors, as well as to transfect host cells and cell lines, be these 
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prokaryotic (e.g., E. coli), or eukaryotic (e.g., CHO cells, COS cells, yeast expression systems 
and recombinant baculo virus expression in insect cells). Especially useful are mammalian cells 
such as human, mouse, hamster, pig, goat, primate, etc. They may be of a wide variety of tissue 
types, and include primary cells and cell lines. Specific examples include keratinocytes, 
5 peripheral blood leukocytes, bone marrow stem cells and embryonic stem cells. The expression 
vectors require that the pertinent sequence, i.e., those nucleic acids described supra, be operably 
linked to a promoter. 

The invention also includes transgenic non-human animals. As used herein, "transgenic 
non-human animals" includes non-human animals having one or more exogenous nucleic acid 

10 molecules incorporated in germ line cells and/or somatic cells. Thus the transgenic animal 
include "knockout" animals having a homozygous or heterozygous gene disruption by 
homologous recombination, animals having episomal or chromosomally incorporated expression 
vectors, etc. Knockout animals can be prepared by homologous recombination using embryonic 
stem cells as is well known in the art. The recombination can be facilitated by the cre/lox system 

15 or other recombinase systems known to one of ordinary skill in the art. In certain embodiments, 
the recombinase system itself is expressed conditionally, for example, in certain tissues or cell 
types, at certain embryonic or post-embryonic developmental stages, inducibly by the addition of 
a compound which increases or decreases expression, and the like. In general, the conditional 
expression vectors used in such systems use a variety of promoters which confer the desired gene 

20 expression pattern (e.g., temporal or spatial). Conditional promoters also can be operably linked 
to Smad7 nucleic acid molecules to increase expression of Smad7 in a regulated or conditional 
manner. 7hms*-acting negative regulators of Smad7 activity or expression also can be operably 
linked to a conditional promoter as described above. Such trans-acting regulators include 
antisense Smad7 nucleic acids molecules, nucleic acid molecules which encode dominant 

25 negative Smad7 molecules, ribozyme molecules specific for Smad7 nucleic acids, and the like. 
The transgenic non-human animals are useful in experiments directed toward testing biochemical 
or physiological effects of diagnostics or therapeutics for conditions characterized by increased 
or decreased Smad7 expression. Other uses will be apparent to one of ordinary skill in the art. 
The invention also contemplates gene therapy. The procedure for performing ex vivo 

30 gene therapy is outlined in U.S. Patent 5,399,346 and in exhibits submitted in the file history of 
that patent, all of which are publicly available documents. In general, it involves introduction in 
vitro of a functional copy of a gene into a cell(s) of a subject which contains a defective copy of 
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the gene, and returning the genetically engineered cell(s) to the subject. The functional copy of 
the gene is under operable control of regulatory elements which permit expression of the gene in 
the genetically engineered cell(s). Numerous transfection and transduction techniques as well as 
appropriate expression vectors are well known to those of ordinary skill in the art, some of which 
5 are described in PCT application WO95/00654. In vivo gene therapy using vectors such as 
adenovirus, retroviruses, herpes virus, find targeted liposomes also is contemplated according to 
the invention. 

The invention further provides efficient methods of identifying pharmacological agents or 
lead compounds for agents active at the level of a Smad7 or Smad7 fragment modulatable 

10 cellular function. In particular, such functions include TGF-p, activin and/or BMP signal 

transduction and formation of a TGF-P, activin and/or BMP receptor-Smad7 protein complex. 
Generally, the screening methods involve assaying for compounds which interfere with a Smad7 
activity such as TGF-p receptor- Smad7 binding, etc, although compounds which enhance Smad7 
activity also can be assayed using the screening methods. Such methods are adaptable to 

15 automated, high throughput screening of compounds. The target therapeutic indications for 
pharmacological agents detected by the screening methods are limited only in that the target 
cellular function be subject to modulation by alteration of the formation of a complex comprising 
a Smad7 polypeptide or fragment thereof and one or more natural Smad7 intracellular binding 
targets, such as TGF-P receptor. Target indications include cellular processes modulated by 

20 TGF-p, activin and/or BMP signal transduction following receptor-ligand binding. 

A wide variety of assays for pharmacological agents are provided, including, labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays, cell- 
based assays such as two- or three-hybrid screens, expression assays, etc. For example, three- 
hybrid screens are used to rapidly examine the effect of transfected nucleic acids on the 

25 intracellular binding of Smad7 or Smad7 fragments to specific intracellular targets. The 

transfected nucleic acids can encode, for example, combinatorial peptide libraries or antisense 
molecules. Convenient reagents for such assays, e.g., GAL4 fusion proteins, are known in the 
art. An exemplary cell-based assay involves transfecting a cell with a nucleic acid encoding a 
Smad7 polypeptide fused to a GAL4 DNA binding domain and a nucleic acid encoding a TGF-P 

30 receptor domain which interacts with Smad7 fused to a transcription activation domain such as 
VP 16. The cell also contains a reporter gene operably linked to a gene expression regulatory 
region, such as one or more GAL4 binding sites. Activation of reporter gene transcription occurs 
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when the Smad7 and TGF-p receptor fusion polypeptides bind such that the GAL4 DNA binding 
domain and the VP 1 6 transcriptional activation domain are brought into proximity to enable 
transcription of the reporter gene. Agents which modulate a Smad7 polypeptide mediated cell 
function are then detected through a change in the expression of reporter gene. Methods for 
5 determining changes in the expression of a reporter gene are known in the art. 

Smad7 fragments used in the methods, when not produced by a transfected nucleic acid 
are added to an assay mixture as an isolated polypeptide. Smad7 polypeptides preferably are 
produced recombinantly, although such polypeptides may be isolated from biological extracts. 
Recombinantly produced Smad7 polypeptides include chimeric proteins comprising a fusion of a 

10 Smad7 protein with another polypeptide, e.g., a polypeptide capable of providing or enhancing 
protein-protein binding, sequence specific nucleic acid binding (such as GAL4), enhancing 
stability of the Smad7 polypeptide under assay conditions, or providing a detectable moiety, such 
as green fluorescent protein or Flag epitope as provided in the examples below. 

The assay mixture is comprised of a natural intracellular Smad7 binding target such as a 

15 TGF-P receptor or fragment thereof capable of interacting with Smad7. While natural Smad7 
binding targets may be used, it is frequently preferred to use portions (e.g., peptides or nucleic 
acid fragments) or analogs (i.e., agents which mimic the Smad7 binding properties of the natural 
binding target for purposes of the assay) of the Smad7 binding target so long as the portion or 
analog provides binding affinity and avidity to the Smad7 fragment measurable in the assay. 

20 The assay mixture also comprises a candidate pharmacological agent. Typically, a 

plurality of assay mixtures are run in parallel with different agent concentrations to obtain a 
different response to the various concentrations. Typically, one of these concentrations serves as 
a negative control, i.e., at zero concentration of agent or at a concentration of agent below the 
limits of assay detection. Candidate agents encompass numerous chemical classes, although 

25 typically they are organic compounds. Preferably, the candidate pharmacological agents are 
small organic compounds, i.e., those having a molecular weight of more than 50 yet less than 
about 2500, preferably less than about 1000 and, more preferably, less than about 500. 
Candidate agents comprise functional chemical groups necessary for structural interactions with 
polypeptides and/or nucleic acids, and typically include at least an amine, carbonyl, hydroxyl or 

30 carboxyl group, preferably at least two of the functional chemical groups and more preferably at 
least three of the functional chemical groups. The candidate agents can comprise cyclic carbon 
or heterocyclic structure and/or aromatic or polyaromatic structures substituted with one or more 
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of the above-identified functional groups. Candidate agents also can be biomolecules such as 
peptides, saccharides, fatty acids, sterols, isoprenoids, purines, pyrimidines, derivatives or 
structural analogs of the above, or combinations thereof and the like. Where the agent is a 
nucleic acid, the agent typically is a DNA or RNA molecule, although modified nucleic acids as 
5 defined herein are also contemplated. 

Candidate agents are obtained from a wide variety of sources including libraries of 
synthetic or natural compounds. For example, numerous means are available for random and 
directed synthesis of a wide variety of organic compounds and biomolecules, including 
expression of randomized oligonucleotides, synthetic organic combinatorial libraries, phage 

10 display libraries of random peptides, and the like. Alternatively, libraries of natural compounds 
in the form of bacterial, fungal, plant and animal extracts are available or readily produced. 
Additionally, natural and synthetically produced libraries and compounds can be readily be 
modified through conventional chemical, physical, and biochemical means. Further, known 
pharmacological agents may be subjected to directed or random chemical modifications such as 

15 acylation, alkylation, esterification, amidification, etc. to produce structural analogs of the 
agents. 

A variety of other reagents also can be included in the mixture. These include reagents 
such as salts, buffers, neutral proteins (e.g., albumin), detergents, etc. which may be used to 
facilitate optimal protein-protein and/or protein-nucleic acid binding. Such a reagent may also 

20 reduce non-specific or background interactions of the reaction components. Other reagents that 
improve the efficiency of the assay such as protease, inhibitors, nuclease inhibitors, antimicrobial 
agents, and the like may also be used. 

The mixture of the foregoing assay materials is incubated under conditions whereby, but 
for the presence of the candidate pharmacological agent, the Smad7 polypeptide specifically 

25 binds the cellular binding target, a portion thereof or analog thereof. The order of addition of 
components, incubation temperature, time of incubation, and other perimeters of the assay may 
be readily determined. Such experimentation merely involves optimization of the assay 
parameters, not the fundamental composition of the assay. Incubation temperatures typically are 
between 4°C and 40 °C. Incubation times preferably are minimized to facilitate rapid, high 

30 throughput screening, and typically are between 0.1 and 10 hours. 

After incubation, the presence or absence of specific binding between the Smad7 
polypeptide and one or more binding targets is detected by any convenient method available to 
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the user. For cell free binding type assays, a separation step is often used to separate bound from 
unbound components. The separation step may be accomplished in a variety of ways. 
Conveniently, at least one of the components is immobilized on a solid substrate, from which the 
unbound components may be easily separated. The solid substrate can be made of a wide variety 
5 of materials and in a wide variety of shapes, e.g., microtiter plate, microbead, dipstick, resin 
particle, etc. The substrate preferably is chosen to maximum signal to noise ratios, primarily to 
minimize background binding, as well as for ease of separation and cost. 

Separation may be effected for example, by removing a bead or dipstick from a reservoir, 
emptying or diluting a reservoir such as a microtiter plate well, rinsing a bead, particle, 

10 chromotograpic column or filter with a wash solution or solvent. The separation step preferably 
includes multiple rinses or washes. For example, when the solid substrate is a microtiter plate, 
the wells may be washed several times with a washing solution, which typically includes those 
components of the incubation mixture that do not participate in specific bindings such as salts, 
buffer, detergent, non-specific protein, etc. Where the solid substrate is a magnetic bead, the 

15 beads may be washed one or more times with a washing solution and isolated using a magnet. 

Detection may be effected in any convenient way for cell-based assays such as two- or 
three-hybrid screens. The transcript resulting from a reporter gene transcription assay of Smad7 
polypeptide interacting with a target molecule typically encodes a directly or indirectly 
detectable product, e.g., P-galactosidase activity, luciferase activity, and the like. For cell free 

20 binding assays, one of the components usually comprises, or is coupled to, a detectable label. A 
wide variety of labels can be used, such as those that provide direct detection (e.g., radioactivity, 
luminescence, optical or electron density, etc), or indirect detection (e.g., epitope tag such as the 
FLAG epitope, enzyme tag such as horseseradish peroxidase, etc.). The label may be bound to a 
Smad7 binding partner, or incorporated into the structure of the binding partner. 

25 A variety of methods may be used to detect the label, depending on the nature of the label 

and other assay components. For example, the label may be detected while bound to the solid 
substrate or subsequent to separation from the solid substrate. Labels may be directly detected 
through optical or electron density, radioactive emissions, nonradiative energy transfers, etc. or 
indirectly detected with antibody conjugates, strepavidin-biotin conjugates, etc. Methods for 

30 detecting the labels are well known in the art. 

The invention provides Smad7-specific binding agents, methods of identifying and 
making such agents, and their use in diagnosis, therapy and pharmaceutical development. For 
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example, Smad7-specific pharmacological agents are useful in a variety of diagnostic and 
therapeutic applications, especially where disease or disease prognosis is associated with 
improper utilization of a pathway involving Smad7, e.g., TGF-P induced phosphorylation of 
Smad2 or Smad3, TGF-0 receptor-Smad7 complex formation, activin or BMP signalling, etc. 
5 Novel Smad7-specific binding agents include Smad7-specific antibodies and other natural 
intracellular binding agents identified with assays such as two hybrid screens, and non-natural 
intracellular binding agents identified in screens of chemical libraries and the like. 

In general, the specificity of Smad7 binding to a binding agent is shown by binding 
equilibrium constants. Targets which are capable of selectively binding a Smad7 polypeptide 

10 preferably have binding equilibrium constants of at least about 10 7 M" 1 , more preferably at least 
about 10 8 M 1 , and most preferably at least about 10 9 M* 1 . The wide variety of cell based and cell 
free assays may be used to demonstrate Smad7-specific binding. Cell based assays include one, 
two and three hybrid screens, assays in which Smad7-mediated transcription is inhibited or 
increased, etc. Cell free assays include Smad7-protein binding assays, immunoassays, etc. 

15 Other assays useful for screening agents which bind Smad7 polypeptides include fluorescence 
resonance energy transfer (FRET), and electrophoretic mobility shift analysis (EMSA). 

Various techniques may be employed for introducing nucleic acids of the invention into 
cells, depending on whether the nucleic acids are introduced in vitro or in vivo in a host. Such 
techniques include transfection of nucleic acid-CaP0 4 precipitates, transfection of nucleic acids 

20 associated with DEAE, transfection with a retrovirus including the nucleic acid of interest, 

liposome mediated transfection, and the like. For certain uses, it is preferred to target the nucleic 
acid to particular cells. In such instances, a vehicle used for delivering a nucleic acid of the 
invention into a cell (e.g., a retrovirus, or other virus; a liposome) can have a targeting molecule 
attached thereto. For example, a molecule such as an antibody specific for a surface membrane 

25 protein on the target cell or a ligand for a receptor on the target cell can be bound to or 
incorporated within the nucleic acid delivery vehicle. For example, where liposomes are 
employed to deliver the nucleic acids of the invention, proteins which bind to a surface 
membrane protein associated with endocytosis may be incorporated into the liposome 
formulation for targeting and/or to facilitate uptake. Such proteins include capsid proteins or 

30 fragments thereof tropic for a particular cell type, antibodies for proteins which undergo 

internalization in cycling, proteins that target intracellular localization and enhance intracellular 
half life, and the like. Polymeric delivery systems also have been used successfully to deliver 
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nucleic acids into cells, as is known by those skilled in the art. Such systems even permit oral 
delivery of nucleic acids. 

When administered, the therapeutic compositions of the present invention are 
administered in pharmaceutically acceptable preparations. Such preparations may routinely 
5 contain pharmaceutically acceptable concentrations of salt, buffering agents, preservatives, 

compatible carriers, supplementary immune potentiating agents such as adjuvants and cytokines 
and optionally other therapeutic agents. 

The therapeutics of the invention can be administered by any conventional route, 
including injection or by gradual infusion over time. The administration may, for example, be 

10 oral, intravenous, intraperitoneal, intramuscular, intracavity, subcutaneous, or transdermal. 
When antibodies are used therapeutically, a preferred route of administration is by pulmonary 
aerosol. Techniques for preparing aerosol delivery systems containing antibodies are well 
known to those of skill in the art. Generally, such systems should utilize components which will 
not significantly impair the biological properties of the antibodies, such as the paratope binding 

15 capacity (see, for example, Sciarra and Cutie, "Aerosols," in Remington's Pharmaceutical 

Sciences , 18th edition, 1990, pp 1694-1712; incorporated by reference). Those of skill in the art 
can readily determine the various parameters and conditions for producing antibody aerosols 
without resort to undue experimentation. When using antisense preparations of the invention, 
slow intravenous administration is preferred. 

20 The compositions of the invention are administered in effective amounts. An "effective 

amount" is that amount of a composition that alone, or together with further doses, produces the 
desired response, e.g. alters favorably the signal transduction resulting from binding of TGF-P, 
activin, BMP and/or Vgl (Vgr-1) to specific receptors. In the case of treating a particular 
disease, such as cancer, the desired response is inhibiting the progression of the disease. This 

25 may involve only slowing the progression of the disease temporarily, although more preferably, 
it involves halting the progression of the disease permanently. This can be monitored by routine 
methods or can be monitored according to diagnostic methods of the invention discussed herein. 

Such amounts will depend, of course, on the particular condition being treated, the 
severity of the condition, the individual patient parameters including age, physical condition, size 

30 and weight, the duration of the treatment, the nature of concurrent therapy (if any), the specific 
route of administration and like factors within the knowledge and expertise of the health 
practioner. These factors are well known to those of ordinary skill in the art and can be 
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addressed with no more than routine experimentation. It is generally preferred that a maximum 
dose of the individual components or combinations thereof be used, that is, the highest safe dose 
according to sound medical judgment. It will be understood by those of ordinary skill in the art, 
however, that a patient may insist upon a lower dose or tolerable dose for medical reasons, 
5 psychological reasons or for virtually any other reasons. 

The pharmaceutical compositions used in the foregoing methods preferably are sterile 
and contain an effective amount of Smad7 or nucleic acid encoding Smad7 for producing the 
desired response in a unit of weight or volume suitable for administration to a patient. The 
response can, for example, be measured by determining the signal transduction enhanced or 

10 inhibited by the Smad7 composition via a reporter system as described herein, by measuring 
downstream effects such as gene expression, or by measuring the physiological effects of the 
Smad7 composition, such as regression of a tumor or decrease of disease symptoms. Likewise, 
the effects of antisense Smad6 and Smad7 can be readily determined by measuring expression of 
the individual genes in cells to which an antisense composition is added. Other assays will be 

15 known to one of ordinary skill in the art and can be employed for measuring the level of the 
response. 

The doses of Smad7 polypeptide or nucleic acid administered to a subject can be chosen 
in accordance with different parameters, in particular in accordance with the mode of 
administration used and the state of the subject. Other factors include the desired period of 

20 treatment. In the event that a response in a subject is insufficient at the initial doses applied, 
higher doses (or effectively higher doses by a different, more localized delivery route) may be 
employed to the extent that patient tolerance permits. 

In general, doses of Smad7 are formulated and administered in doses between 1 ng and 
1 mg, and preferably between 10 ng and 100 ^g, according to any standard procedure in the art. 

25 Where nucleic acids encoding Smad7 of variants thereof are employed, doses of between 1 ng 
and 0.1 mg generally will be formulated and administered according to standard procedures. 
Other protocols for the administration of Smad7 compositions will be known to one of ordinary 
skill in the art, in which the dose amount, schedule of injections, sites of injections, mode of 
administration (e.g., intra-tumoral) and the like vary from the foregoing. Administration of 

30 Smad7 compositions to mammals other than humans, e.g. for testing purposes or veterinary 
therapeutic purposes, is carried out under substantially the same conditions as described above. 
When administered, the pharmaceutical preparations of the invention are applied in 
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pharmaceutically-acceptable amounts and in pharmaceutically-acceptable compositions. The 
term M pharmaceutically acceptable" means a non-toxic material that does not interfere with the 
effectiveness of the biological activity of the active ingredients. Such preparations may 
routinely contain salts, buffering agents, preservatives, compatible carriers, and optionally 
5 other therapeutic agents. When used in medicine, the salts should be pharmaceutical^ 
acceptable, but non-pharmaceutically acceptable salts may conveniently be used to prepare 
pharmaceutically-acceptable salts thereof and are not excluded from the scope of the invention. 
Such pharmacologically and pharmaceutically-acceptable salts include, but are not limited to, 
those prepared from the following acids: hydrochloric, hydrobromic, sulfuric, nitric, 

10 phosphoric, maleic, acetic, salicylic, citric, formic, malonic, succinic, and the like. Also, 

pharmaceutically-acceptable salts can be prepared as alkaline metal or alkaline earth salts, such 
as sodium, potassium or calcium salts. 

Smad7 may be combined, if desired, with a pharmaceutically-acceptable carrier. The 
term "pharmaceutically-acceptable carrier" as used herein means one or more compatible solid 

15 or liquid fillers, diluents or encapsulating substances which are suitable for administration into 
a human. The term "carrier" denotes an organic or inorganic ingredient, natural or synthetic, 
with which the active ingredient is combined to facilitate the application. The components of 
the pharmaceutical compositions also are capable of being co-mingled with the molecules of 
the present invention, and with each other, in a manner such that there is no interaction which 

20 would substantially impair the desired pharmaceutical efficacy. 

The pharmaceutical compositions may contain suitable buffering agents, including: 
acetic acid in a salt; citric acid in a salt; boric acid in a salt; and phosphoric acid in a salt. 

The pharmaceutical compositions also may contain, optionally, suitable preservatives, 
such as: benzalkonium chloride; chlorobutanol; parabens and thimerosal. 

25 The pharmaceutical compositions may conveniently be presented in unit dosage form 

and may be prepared by any of the methods well-known in the art of pharmacy. All methods 
include the step of bringing the active agent into association with a carrier which constitutes 
one or more accessory ingredients. In general, the compositions are prepared by uniformly 
and intimately bringing the active compound into association with a liquid carrier, a finely 

30 divided solid carrier, or both, and then, if necessary, shaping the product. 

Compositions suitable for oral administration may be presented as discrete units, such 
as capsules, tablets, lozenges, each containing a predetermined amount of the active 
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compound. Other compositions include suspensions in aqueous liquids or non-aqueous liquids 
such as a syrup, elixir or an emulsion. 

Compositions suitable for parenteral administration conveniently comprise a sterile 
aqueous or non-aqueous preparation of Smad7 polypeptides or nucleic acids, which is 
5 preferably isotonic with the blood of the recipient. This preparation may be formulated 
according to known methods using suitable dispersing or wetting agents and suspending 
agents. The sterile injectable preparation also may be a sterile injectable solution or 
suspension in a non-toxic parenterally-acceptable diluent or solvent, for example, as a solution 
in 1,3-butane diol. Among the acceptable vehicles and solvents that may be employed are 

10 water, Ringer's solution, and isotonic sodium chloride solution. In addition, sterile, fixed oils 
are conventionally employed as a solvent or suspending medium. For this purpose any bland 
fixed oil may be employed including synthetic mono-or di-glycerides. In addition, fatty acids 
such as oleic acid may be used in the preparation of injectables. Carrier formulation suitable 
for oral, subcutaneous, intravenous, intramuscular, etc. administrations can be found in 

15 Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, PA. 

In another aspect of the invention, Smad7 polypeptides or nucleic acid are used in the 
manufacture of a medicament for modulating a TGF-p, activin, BMP or Vgl response. The 
medicament can be placed in a vial and be incorporated into a kit to be used for increasing a 
subject's response to one or more of the above TGF-p family members. In certain embodiments, 

20 other medicaments which modulate the same responses or which favorably affect the Smad7 
compositions can also be included in the same kit. The kits can include instructions or other 
printed material on how to administer the Smad7 compositions and any other components of the 
kit. 

25 Examples 
Methods 

Isolation of mSmad7 and hSmad7 cDNA and Northern blot analysis. cDNA encoding the 
complete mSmad7 was made by fusing mouse EST cDNA (AA061644) with a partial cDNA 
30 isolated from a mouse placenta library. The cDNA for hSmad7 was isolated by screening a 
human brain cDNA library. cDNAs were sequenced on both strands using an ABI3 10 Genetic 
Analyzer. Isolation of total RNA, Northern blot analysis was performed as described before 
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(Afrakhte, M. et al. ? Int. J. Cancer 68, 802-809 (1996) using QUIKHYB® hybridization solution 
from Stratagene (La Jolla, CA). 

Expression Plasmids. Expression constructs for TpR-I, T0R-II, BMPR-IA, BMPR-IB, ActR-I, 
5 ActR-IB, Smadl, Smad2, Smad3 and Smad5 have been described previously (Nakao et al., 
EMBO J, 16:5353-5362, 1997). Other Smad or receptor plasmids have been previously 
described or were prepared according to standard protocols. F-Smad7 and F-Smad8 were made 
by a PCR-directed approach and subcloning into pcDNA-Flag. N-terminal and C-terminal 
Smad7 expression vectors were prepared using PCR amplification with specific primers and 

10 Smad7 cDNA as template. Expression constructs for the C-terminal domain of F-Smad7 (7C; 
amino acids 204-426), F-Smad7C with C-tail deletion (7CA; amino acids 204-207), Smad7 with 
deletion of C-tail (7A; amino acids 1-407), the N-terminal domain of Smad7 (7N; amino acids 1- 
203), mouse F-Smad6 "long version" (F-Smad6L)(Imamura et al., 1997) and human F-Smad6 
"short version" (F-Smad6S) (Topper et al., 1997), were made by a PCR-directed approach and 

15 were subcloned into pcDNA3-Flag. The resulting expression plasmids expressed N-terminal and 
C-terminal Smad domains tagged at their N-termini with the Flag epitope. Alternatively, the 
Smads and Smad fragments were subcloned into pMEP4 for inducible expression of the proteins. 
Anti-sense expression constructs for Smad molecules including mSmad7 and hSmad3 were 
prepared by cloning the complete Smad coding regions in reverse direction in pcDNA3 

20 expression vector (Invitrogen, Carlsbad, CA). 



Cell Assays. Transient transfection, metabolic labeling, immunoprecipitation, 
[ 32 P]orthophosphate labeling of cells, and SDS-PAGE were performed as described previously 
(Nakao et al., 1997). 

25 

Iodination of Ligand and Affinity Cross-linking. Iodination of TGF-pl and affinity cross- 
linking followed by immunoprecipitation were performed as previously described (Nakao et al., 
1997). TGF-pi and BMP-7 were iodinated using the chloramine-T method according to Frolik 
et al (J. Biol. Chem. 259:10995-1 1000, 1984). Cross-linking was performed as previously 
30 described (Nakao et al., 1997). In some cases, incubation with 125 I-TGF-pi was performed at 
room temperature for 2 h (Souchelnytskyi et al., J. Biol Chem. 272:27678-27685, 1997) and the 
transfected cells lines were stimulated for 24 h with 100/uM/ml zinc chloride to induce the 
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expression of Smad7. Complexes of Smads and affinity-labeled receptors were 
immunoprecipitated with antiserum directed against epitope tags in Smads. To determine 
expression levels of receptors, aliquots of cell lysates were immunoprecipitated with antisera 
against type I receptors. Expression of Smads was determined by Western blotting on aliquots of 
5 cell lysates. 



Transcriptional response assay. MvlLu wild-type and R mutant cells were transiently 
transfected with p3TPLux using DEAE-dextran transfection method, as described before (Nakao 
et al., 1997). HaCat cells were transiently transfected with p21 reporter plasmid (Datto et al., J. 

10 Biol Chem. 270: 28623-28628, 1995) using Transfectam reagent from Promega (Madison, WI). 
In each experiment, equal total amounts of DNA were transfected. In other experiments, MvlLu 
cells were transfected with p3TPLux and anti-sense expression constructs using Lipofectin; after 
incubation overnight in Optimum medium, the medium was changed to serum-containing 
medium. After another 24 h or incubation the cells were stimulated with TGF-pi and luciferase 

15 activity was measured. Luciferase activity was measured as previously described (Nakao et al., 
1997). The values were normalized for transfection efficiency using the [J-gal reporter gene 
under transcriptional control of CMV promoter (pCMV5 vector). Results shown are 
representative of at least three or four independent experiments. 

Transient transfections of COS cells were performed using the DEAE-dextran protocol. 

20 Stable transfection of MvlLu cells with pMEP4 expression vector (Invitrogen) was done using 
calcium phosphate precipitation method, as previously described (Souchelnytskyi et aL, 1997); 
selection was performed with 420 units/ml hygromycin. Induction with zinc chloride was done 
with lOOyuM zinc chloride for 20 h, unless indicated otherwise. Metabolic labeling of cells, 
[ 32 P]orthophosphate labeling, and immunoprecipitation, and SDS-PAGE were performed as 

25 earlier described (Souchelnyskyi et al., 1997 or Nakao et al., 1997). 



Xenopus embryo culture and manipulation. Xenopus eggs were obtained and embryos 
microinjected and cultured as described (Moon and Christian, Technique 1 : 76-89, 1989). For 
animal cap assays, 200 pg of RNA encoding activin-p or a bone morphogenetic protein- Vgl 
30 chimera (Dale et aL, EMBOJ. 12: 4471-4480, 1993) was injected either alone, or together with 
400 pg of Smad7 RNA. For Smad7/Smad8 experiments, RNA was synthesized by in vitro 
transcription (Moon and Christian, 1989) of pCS2+Smad8 and pCS2+Smad7 (Nakayama et al., 
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Development 125:857-867, 1998). The latter construct was generated by subcloning the coding 
region of F-Smad7 into pCS2+ (Turner and Weintraub, Genes Dev. 8:1434-1447, 1994). 
Embryonic stages are according to Nieuwkoop and Faber {Normal table ofXenopus laevis, 
Garland Publishing, Amsterdam, North Holland, 1967). Immunostaining of whole embryos 
5 (12/101 antibody was obtained from the Developmental Studies Hybridoma Bank under contract 
N01-HD-2-3144 from the NICHD, Tor 70 was obtained from R. Harland), whole mount in situ 
hybridization, RT-PCR analysis of RNA extracted from cultured animal caps, and quantitation of 
brachyury expression relative to that of EF-la were performed as described (Lagna et aL, Nature 
383: 832-836, 1996; Cui et aL, Dev. Biol 180: 22-34, 1996; Moon and Christian, 1989). 

10 

Cell Lines. COS cells and MvlLu mink lung epithelial cells and HaCat cells (human 
keratinocytes) were obtained from American Type Culture Collection. Cells were cultured in 
Dulbecco's modified Eagle's medium (Life Technologies, Inc., Gaithersburg, MD) 
supplemented with 10% fetal bovine serum (FBS) and antibiotics (100 units of penicillin and 50 

15 jag of streptomycin per ml). All the 10 different human lung carcinomas used [small cell lung 
carcinoma (SCLC), U-1285, U-1690, H-69 and H-92; non-SCLC, U-1752; squamous cell 
carcinoma (SQC); U-1810; large-cell carcinoma (LCC), H-157, H-661; adenocarcinoma (ADC) 
H-23 and H-125 (Histological typing of lung tumors, according to WHO, Geneva)] were grown 
in RPMI 1640 medium with 10% FBS and antibiotics. (Heldin et aL, Br. J. Cancer, 68: 708- 

20 711, 1993). 

RNA Isolation and Northern Blot Analysis. Cells were kept in 0.5% FBS 12-24 h prior to 
stimulation with various factors followed by RNA extraction. Lung carcinoma cells used for 
RNA extraction were grown in the presence of 10% FCS. Isolation of total RNA and Northern 

25 blotting was performed essentially as described (Afrakhte et al., Int. J. Cancer, 68:802-809, 
1996). The RNA extracted from the lung carcinomas cell lines was poly(A) + -enriched. 
Hybridizations were performed using QuickHyb buffer from Stratagene. The intactness and total 
amount of RNA (12 (ag or total RNA or 5 |ig of poly(A) + RNA per lane) were checked by 
staining the gel with ethidium bromide and by hybridization with a glyceraldehyde 3 -phosphate 

30 dehydrogenase (GAPDH) probe. The cDNA probes used in the hybridizations were a 1 .8 kb 

Eco/Xho I human Smad7 fragment, a 2.0 kb EcoKL fragment of the human Smad6, a 3 kb Eco RI 
fragment of the human PAI-1, and a 1 .5 kb mouse JunB fragment. For the GAPDH 
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hybridizations the entire human GAPDH cDNA plasmid was labeled. 
Antibodies 

Antibodies against Smad2, termed DQQ, have been previously described (Nakao et al., 
5 1997). Anti-Flag antibody was purchased from Kodak (New Haven). Specific antisera for 
Smad6, termed ESP and SRQ were raised against ESPPPPYSRLSPRDEYKPLD (SEQ ID 
NO:l 1) and SRQFITSCPCWLEILNPR (SEQ ID NO:12), respectively. Specific antisera raised 
against Smad7, termed KER and KAV, were raised against the synthetic peptides 
KERQLELLLQAVESRGGTRTA (SEQ ID NO: 13) and KAVRGAKGHHHPHPP (SEQ ID 
10 NO: 14), respectively. The peptides were coupled to keyhole limpet haEmocyanin (Calbiochem- 
Behring) with glutaraldehyde, mixed with Freund's adjuvant, and used to immunize rabbits 
according to standard procedures. 

Growth Inhibition assay 

MvlLu cells were seeded at 1X1 0 4 cells/well in 24-well plates. Prior to addition of 
TGF-pl the MvlLu cells were simultaneously treated (or not) with 100 fuM zinc chloride and 
indicated concentrations of TGF-pi for 20h to induce Smad7 expression. Before harvesting, the 
cells were pulsed with o.2 ^uCi of [ 3 H] thymidine (6.7 Ci/mmol, Amersham, UK) for 2h. The 
cells were fixed with ice-cold 5% trichloroacetic acid (TCA) for more than 20 min, washed twice 
with 5% TCA and once with water. Solubilization of the cells was done with 400//1 of 0.1 M 
NaOH for 20 min at room temperature. The 3 H-radioactivity incorporated into DNA was 
determined by liquid scintillation counting. 

Cells were grown in LAB TEK chambers (Nunc, Naperville, IL), and incubated with 
25 DMEM containing 0.3% FCS in the absence or presence of TGF-pl for 2h. The slides were 

washed once with phosphate buffered saline (PBS), fixed for 10 min with 4% paraformaldehyde, 
followed by three washes with PBS, subsequently permeabilized with 0.1% Triton X-100 in PBS 
for 5 min, and again washed three times with PBS. Slides were blocked by 10% goat serum for 
lh in room temperature, then incubated with 10% goat serum with anti-FLAG antibody 
30 (20/zg/ml) for 15 h at 4°C. The slides were subsequently washed 3 times, incubated with 
TRITC-conjugated goat anti-mouse IgG antibody (diluted 1 :40) and washed again 4 times. 
Nuclei were stained with DAPI (l//g/ml) for 10 min in room temperature followed by three 



15 



20 
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washes. In order to visualize the fluorescence, a Zeiss microscope, or laser confocal microscope 
was used. For cell counting a square lattice mounted in one of the eyepieces was used. In the 
COS cell experiments 200 cells were counted in 5 different field and the nuclear localization was 
checked by DAPI staining. 

5 

Example 1: Cloning and characterization of Smad7 

A database search for mammalian sequences related to Smads revealed the existence of 
two mouse expression sequence tags (ESTs), i.e. AA022262 and AA061644, with sequence 
similarity to N- and C-terminal Smad domains, respectively. Polymerase chain reaction (PCR) 

10 using kidney cDNA as a template with a PCR primer pair derived from both ESTs resulted in a 
specific product. A cDNA encoding the complete mouse protein was made by fusing mouse 
EST cDNA with a cDNA isolated from a mouse placenta library, and was termed mouse Smad7 
(mSmad7). Human Smad7 cDNA (hSmad7) was isolated from a human brain cDNA library. 
The cDNA sequences predict that mSmad7 and hSmad7 have 426 amino acid residues with 98% 

15 identity (Fig. la). 

All differences between hSmad7 and mSmad7 are found in the N-terminal domain. The 
borders of the Mad homology 2 (MH2) domain are indicated by arrows. The nucleotide 
sequences are deposited in European Molecular Biology Laboratory/GenBank data library 
(mouse and human Smad7 accession numbers are AF0152260 and AF015261, respectively). 

20 Smad7 is most related to Smad6 (Imamura et al., Nature, 389:622-626, 1997) with 36% and 56% 
sequence identities in the N-terminal domain and the C-terminal MH2 domain, respectively. The 
Smad7 N-terminal domain, which contains a glycine/glutamic acid residue-rich region, shows 
only weak similarity (approximately 15%) to the MH1 domains found in Smadl through Smad5. 
RNA blot analysis of various tissues with a probe from the region coding for the N- 

25 terminal domain of Smad7 revealed one major transcript of approximately 4.4kb (Fig. lb). 
Among the tissues analyzed, the highest expression of Smad7 was found in the lung. 

Example 2: Smad7 modulates TGF-P superfamily signal transduction 

To investigate whether Smad7 modulates the responsiveness to TGF-p, the TGF-P- 
30 inducible luciferase p3TPLux reporter construct, which contains the TGF-p-inducible PAI-1 

promoter, was transfected into MvlLu mink epithelial cells in the absence or presence of Smad7 
cDNA. Smad7 was found to exert a dose-dependent inhibition TGF-P-induce luciferase activity 



WO 98/53068 PCTYUS98/10305 

-44- 

(Fig. 2a). Moreover, the induction of p3TPLux luciferase by a constitutively active variant of 
TpR-I, when transfected in R-mutant cells, was also inhibited by cotransfection with Smad7, as 
was the response by a constitutively active variant of the structurally related type I receptor for 
activin (ActR-IB) (Fig. 2B). Transfection of Smad2 did not affect TGF-p 1 -induced p3TPLux 
5 luciferase response in MvlLu cells (Fig. 2a). This inhibitory effect was specific as Smad7 did 
not inhibit the phorbol 12-myristate 13 -acetate (PMA)/epidermal growth factor-induced 
p3TPLux luciferase response. In addition, the forskolin-mediated transcriptional induction using 
a cAMP-responsive-element-containing reporter construct was not affected by Smad7. These 
results indicate that Smad7 is a potent negative regulator of both TpR-I- and ActR-IB-induced 

10 p3 TPLux response . 

To examine the effect of Smad7 expression on TGF-p-mediated growth inhibition, a 
luciferase transcriptional reporter construct containing the p21 CDK inhibitor promoter (p21Lux) 
that is induced by TGF-P in human keratinocytes (HaCat) (Imamura, 1997) was used. Smad7 
was found to antagonize TGF-p 1 -mediated p21Lux response in HaCat cells dose-dependently 

15 (Fig. 2c). Increased expression of Smad2 by transfection slightly stimulated this response, 
whereas Smadl transfection had no effect (Fig. 2c). Smad7 and N-terminally Flag-tagged 
Smad7 gave essentially the same antagonistic results on TGF-p signaling, suggesting that N- 
terminal tagging does not interfere with functional properties of Smad7. Thus, Smad7 inhibits 
TGF-P-induced pathways leading to extracellular matrix production as well as growth inhibition. 

20 To determine whether Smad7 can inhibit transduction of TGF-p/activin-like signals in 

vivo, patterning defects caused by overexpression of Smad7 in Xenopus embryos were analyzed. 
When the endogenous activin signaling pathway is inactivated in early Xenopus embryos, by 
introduction of dominant negative forms of either an activin receptor (Hemmati-Brivanlou et al., 
Nature 359: 606-614, 1992; Chang et al., Development 124: 827-837, 1997) or of Smad4 (Lagna 

25 et aL, Nature 383: 832-836, 1996), mesoderm fails to form. Similarly, microinjection of RNA 
encoding Smad7 into both blastomeres of two-cell embryos inhibited mesoderm formation (Fig. 
2d-g). Five hundred picograms of synthetic RNA encoding either Smad7 or myc-epitope tag 
(MT) was injected into each blastomere of two-cell embryos as shown schematically (Fig. 2d) 
and embryos were cultured to the tailbud stage. Embryos injected with MT RNA developed 

30 normally (top embryo in each panel) whereas embryos made to misexpress Smad7 (bottom 

embryo in each panel) failed to form head or tail structures (Fig. 2e) and showed a complete or 
partial loss of muscle (Fig. 2f) and notochord (Fig. 2g). Arrows indicate immunoreactive muscle 
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(Fig. 2f) and notochord (Fig. 2g) in MT-injected embryos. Specifically, head and tail structures 
were absent or severely deficient in 80% of injected embryos (n=168) (Fig. 2e) as were 
mesodermal derivatives such as muscle (n=60) (Fig. 2f) and notochord (n=48) (Fig. 2g). 

To further test the ability of Smad7 to block mesoderm formation in vivo, embryos were 
5 analyzed for expression of brachyury , a gene that is expressed throughout the presumptive 
mesoderm during gastrulation (Smith et al., Cell 67:79-87,1991) (Fig. 2h). Five hundred 
picograms of synthetic RNA encoding MT (Fig. 2h) or Smad7 (Fig. 2i) was injected into one 
blastomere of two-cell embryos near the equator and expression of brachyury analyzed by whole 
mount in situ hybridization. MT-injected embryos show the typical ring of brachyury expression 

10 (Fig. 2h) while brachyury transcripts are not detected on one side of Smad7-injected embryos 
(Fig. 2i, arrow). As shown in Fig. 2i, injection of Smad7 RNA into the equatorial region of one 
blastomere of two-cell embryos prevented brachyury expression on one side of the embryo. 

Two members of the TGF-p family, activin and Vgl, are candidates for being 
endogenous mesoderm- inducing molecules (Kessler et al., Science 266: 596-604, 1994). A 

15 Xenopus animal cap assay (Kessler, 1994) was usedto directly test the ability of Smad7 to block 
signaling downstream of these ligands. While both activin and Vgl induced expression of 
brachyury in ectodermal explants (animal caps), coexpression of Smad7 inhibited brachyury 
induction of 60%, thereby demonstrating that Smad7 can block activin and Vgl -mediated 
mesoderm induction. 

20 To obtain insight into the mechanism by which Smad7 exerts its negative role in 

signaling by TGF-p family members, it was tested whether Smad7, like Smad2 and Smad3 
(Zhang et al., Nature 383:168-172, 1996; Macias-Silva et al., Cell 87:1215-1224, 1996; Nakao et 
al., 1997), can associate with the TGF-p receptor complex. COS cells transfected with 
N-terminally Flag-tagged Smad7 in combination with TpR-II (wild-type, WT, or kinase- 

25 deficient mutant, KD) and TpR-I (wild-type or kinase deficient mutant) were affinity labeled 
with 125 I-TGF-pl, and cell lysates were subjected to immunoprecipitation with Flag antiserum 
against the Flag-epitope in Smad7. Immunoprecipitates were analyzed by SDS-PAGE and FujiX 
Bio-Imager. Expression of receptors and F-Smad7 after transfection was determined by 
immunoprecipitation with VPN antibody against TpR-I or DRL antibody against TpR-II on 

30 aliquots of cell lysates, and by immunoprecipitation with Flag antibody on lysates [ 35 S] 

methionine/cysteine labeled transfected cells. It was found that Smad7 interacted very efficiently 
with the TGF-P receptor complex (Fig. 3). Smad7 interacted with wild-type TpR-I as well as 
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kinase inactive TpR-I in complex with wild-type TpR-II. In contrast, Smad2 and Smad3 interact 
stably only with complex of kinase-deficient T(JR-I and wild-type TPR-II (Macias-Silva et al., 
1996; Nakao et al., 1997). No interaction was observed between Smad7 and a heteromeric 
complex of kinase-deficient TpR-II and TpR-I, or with TpR-II alone (Fig. 3). Thus, 
5 transphosphorylation of TpR-I by TpR-II kinase is required for association of Smad7 with 
TpR-I. 



Example 3: Phosphorylation of Smad7 

Phosphorylation of Smad2 and Smad3, but not Smad4, is induced after TGF-p 

10 stimulation (Eppert et al., Cell 86: 543-552, 1996; Zhang et al., 1996; Macias-Silva et al., 1996; 
Nakao et al., 1997; Lagna et al., 1996. It was investigated whether Smad7 is phosphorylated 
upon association with TGF-p receptors. COS cells were transfected with Smad7 (or for 
comparison with Smad2) in the absence or presence of receptors, labeled with 
[ 32 P]orthophosphate, and treated with TGF-p 1. Cell lysates were subjected to 

15 immunoprecipitation with antisera against Smads or epitope tags, and the level of Smad 

phosphorylation was determined. Expression of type I receptors and Smads was analyzed by 
immunoprecipitation on aliquots of cell lysates, which had been labeled with [ 35 S]- 
methionine/cysteine. The 32 P- or 35 S-radioactivity associated with Smad2 in (Fig. 4b) or Smad3 
in (Fig. 4d) were quantitated by using a FujiX Bio-Imager and the 32 P/ 35 S-ratio calculated (Fig. 

20 4c and e, respectively). Representative results of three independent experiments are shown. As 
expected, Smad2 phosphorylation was increased upon coexpression with receptors, and addition 
of TGF-p 1 led to a further increase of Smad2 phosphorylation. However, no (or only very weak) 
phosphorylation of Smad7 was observed in cells transfected with constitutively active forms of 
TpR-I and ActR-IB or in cells transfected with wild-type TpR-I and TpR-II after stimulation 

25 with ligand (Fig. 4a). Thus, in spite of its direct association with TpR-I, Smad7 is not a direct 
substrate for the TpR-I kinase. Notably, Smad7 lacks the conserved SS(M/V)S C-tail sequence 
motif (Fig. la), which in the case of Smad2 (Macias-Silva et al., 1996) and Smadl (Kretzschmar 
et al., Genes Dev. 1 1 : 984-995, 1997) are phosphorylated by type I receptor kinase. 



30 



Example 4: Smad7 reduces phosphorylation of Smad proteins 

Smad7 may exert its negative role in TGF-P signaling by interfering with activation of 
other Smads. Therefore it was examined whether Smad7 affects the phosphorylation of Smad2 
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and Smad3 using [ 32 P]orthophosphate-labeled COS cells transfected with TGF-P receptors and 
Smads. COS cells were transfected with F-Smad7 or Smad2 (Fig. 4a), Smad2 alone or together 
with Smad7 (Fig. 4b), and N-terminally Myc-tagged Smad3 (M-Smad3) alone or together with 
F-Smad7 (Fig. 4d) in the absence or presence of wild-type or constitutive active (c.a.) receptors, 
5 whereafter cells were labeled with [ 32 P]orthophosphate. C-terminally HA-tagged type I receptors 
were used. Interestingly, Smad7 abolished the TGF-P 1 -induced phosphorylation of Smad2 (Fig. 
4b,c) and Smad3 in dose-dependent manner (Figs. 4d,e). A Smad7 level twice as high as Smad3 
appeared sufficient to cause a significant decrease in TGF-P-induced receptor-mediated Smad3 
phosphorylation. As previously reported (Macias-Silva et al., 1996; Nakao et al., 1996), it was 

10 found that Smad2 and Smad3 become phosphorylated when overexpressed in COS cells. For 
Smad2 this receptor-independent phosphorylation does not occur at the SSMS motif in the C- 
terminal tail (Macias-Silva et al., 1996), and may possibly occur via non-receptor kinases. 
Smad7 appears not to block this receptor-independent phosphorylation. Since activation of 
Smad2 and Smad3 is essential for optimal TGF-p signaling (Zhang et al., 1996; Macias-Silva et 

15 al., 1996; Nakao et al, 1997; Lagna et al., 1996; Kretzschmar et al, 1997), the inhibition of their 
phosphorylation provides a mechanistic explanation for the antagonistic effect of Smad7. The 
association of Smad7 with TpR-I suggests that it may compete with Smad2 and Smad3 for 
receptor binding. In accordance with this notion, co-transfection of Smad2 or Smad3 with 
Smad7 reduced the antagonistic effect of Smad7 in a TGF-P 1 -induced p3TPLux assay. In 

20 addition, it was determined that Smad7 not only inhibits TGF-P and activin signaling, but also 
blocks BMP signaling. Using methodology described herein, it was determined that Smad7 
associated with BMPR-Is and inhibited phosphorylation of Smadl in cultured mammalian cells. 
In addition, injection of Smad7 RNA into ventral cells of Xenopus embryos phenocopies the 
effect of blocking the BMP signaling pathway, and led to formation of an incomplete secondary 

25 dorsal axis. 

Example 5: Regulation of Smad7 expression by TGF-p 

Regulation of Smad7 expression by signaling molecules may be used to effectively 
modulate TGF-P responses. Therefore, it was investigated whether the expression of Smad7, 
30 and for comparison Smad2, Smad3 and Smad4, were regulated by TGF-p 1 . Northern blots with 
20 /^g/lane total RNA from TGF-P 1 stimulated MvlLu cells, SW1736 human anaplastic thyroid 
carcinoma cells and HaCat cells probed with Smad7 revealed that Smad7 mRNA was rapidly 
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induced in response to lOng/ml TGF-P 1 stimulation (Fig. 5a,b). The amount of RNA loaded was 
checked by hybridization of filters to a glyceraldehyde-3 -phosphate dehydrogenase (GAPDH) 
cDNA probe. In MvlLu cells, Smad7 mRNA was induced 4-fold after 30 min of TGF-P 
stimulation. Expression of Smad2, Smad3 and Smad4 upon TGF-P 1 stimulation remained 
5 unchanged (Fig. 5a). Induction of Smad7 mRNA by TGF-P 1 was observed in the presence of 
20 /Ug/ml cycloheximide (CHX), indicating that de novo protein synthesis is not required for 
TGF-P 1 -mediated induction of Smad7. In fact, Smad7 mRNA was superinduced in the presence 
of TGF-p 1 and 20 /zg/ml CHX added 30 min. prior to TGF-pl (Fig. 5b). This is likely caused by 
increase in mRNA stability or loss of transcriptional repressors by CHX. Treatment with CHX 

10 alone only slightly increased the basal Smad7 mRNA level. In contrast, 10 nM PMA, which like 
TGF-pi stimulates PAI-1 expression and inhibits the growth of MvlLu cells, did not induce 
Smad7 expression (Fig. 5c). In MvlLu cells lacking functional TpR-I (R mutant), TGF-pl had 
no effect on Smad7 mRNA expression. Taken together these data indicate that Smad7 is an 
immediate-early response gene for TGF-P 1, and activated complexes of Smad2, Smad3 and 

15 Smad4 may directly and/or in combination with DNA binding cofactors act on the Smad7 
promoter. The intracellular antagonist Smad7 may thus act in a negative feedback loop to 
regulate the intensity or duration of the TGF-p signal. 

Example 6: Smad7 interacts with BMP receptor complexes and inhibits BMP-mediated 
20 signaling. 

To determine if Smad7 associates with the BMP receptor complex, COS cells were 
transfected with the BMP receptors BMPR-IA, BMPR-IB, or ActR-I (wild-type or kinase- 
deficient mutant) tagged at C-terminus with HA epitope, and BMPR-II (wild-type or kinase- 
deficient mutant), in the absence or presence of N-terminally Flag-tagged Smad7 (F-Smad7). 

25 The receptors were covalently affinity labeled with 125 I-BMP-7 5 and cell lysates were subjected to 
immunoprecipitation with Flag antibody to immunoprecipitate F-Smad7. Immunoprecipitates 
were analyzed by SDS-PAGE and FujiX Bio-Imager. Expression of receptors and F-Smad7 (not 
shown) after transfection was determined by immunoprecipitation with HA-antisera on aliquots 
of cell lysates, and by immunoprecipitation with Flag antibody on lysates from [ 35 S] 

30 methionine/cysteine labeled transfected cells. As shown in Figs. 6A-F, TGFsuperfamily 

receptors coimmunoprecipitated with F-Smad7. Therefore, Smad7 associated with BMPR-IA 
and BMPR-IB, and less efficiently with ActR-I receptors. As shown in Fig. 6B, no interaction 
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was observed between Smad7 and heteromeric complexes of a kinase-deficient type II receptor 
and a wild-type or a kinase-deficient type I receptor. Thus Smad7 only interacts with type I 
receptors which are phosphorylated by the type II receptor (i.e. activated). Among the tested 
type I receptors, Smad7 preferentially interacts with BMPR-IA, BMPR-IB, TpR-I and likely 
5 ActR-IB, and less efficiently with ActR-I. In addition, the effect of Smad7 on BMPR-IB- and 
ActR-I-mediated phosphorylation of Smadl or Smad5 was examined (Fig. 6 C-F). Fig. C shows 
that Smad 7 inhibits BMPR-IB- and ActR-I-mediated phosphorylation of Smadl. COS cells 
were transfected with F-Smadl alone or together with F-Smad7 in the presence of type I 
receptors (BMPR-IB or ActR-I) and BMPR-II in the absence or presence of BMP-7, whereafter 

10 cells were labeled with [ 32 P]orthophosphate and immunoprecipitated with Flag antisera. 

Expression of F-Smadl and F-Smad7 was determined by immunoblotting with Flag on aliquots 
of cell lysates. Fig 6D shows that the 32 P-radioactivity associated with Smadl in Fig. 6C as 
quantified by using a FujiX Bio-Imager and plotted. Fig. 6E shows that Smad7 inhibits BMPR- 
IB phosphorylation of SmadS. COS cells were transfected with F-Smad5 alone or together with 

15 F-Smad7, in the presence of BMPR-IB and BMPR-II; cells were incubated in the absence or 
presence of BMP-7, whereafter they were labeled with [ 32 P]orthophosphate and subjected to 
immujnoprecipitation with Flag antisera. Fig. 6F shows the 32 P -radioactivity associated with 
SmadS in Fig. 6E as quantified using a FujiX Bio-Imager and plotted. COS cells were 
transfected with cDNAs encoding BMP receptors and Smadl or Smad5 in the absence or 

20 presence of Smad7. The phosphorylation level of Smadl or SmadS was examined by 

[ 32 l']orthophosphate labeling and immunoprecipitation from cell lysates using Smad antisera. 
Smad7 potently inhibited BMPR-IB-mediated phosphorylation of Smadl and SmadS in a dose 
dependent manner (Fig. 6 C-F). In addition, Smad7 inhibited ActR-I-mediated phosphorylation 
of Smadl (Fig. 6 C, D). The specificity of the interaction based on the phosphorylation of the 

25 type I receptor by type II receptor kinase strongly suggests physiological importance of the 
interaction. Phosphorylation may induce a conformational change that is required for Smad7- 
type I receptor interaction. 

To determine if Smad7 inhibits BMP-mediated phosphorylation of pathway specific 
Smads, COS cells were transfected with F-Smadl or F-Smad5, alone or together with F-Smad7 

30 in the absence or presence of constitutive active (c.a.) BMPR-IB (Fig. 7A), or alone or together 
with F-Smad7, type I receptors and BMPR-II in the absence or presence of BMP-7 (Fig. 7C), 
whereafter cells were labeled with [ 32 P]-orthophosphate. The amount of transfected Smad7 is 
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indicated in Fig. 7C (0.1 or 1.0 microgram). Cell lysates were subjected to immunoprecipitation 
with antisera against Smads, and the level of phosphorylation determined. Expression of Smads 
and BMPR-IB was analyzed by immunoprecipitation with antisera directed towards epitope tags 
on aliquots of cell lysates, which had been labeled with [ 35 S] methionine/cysteine. In Fig. 7B, 
5 the 32 P- or 35 S -radioactivity associated with Smadl were quantitated by using FujiX Bio-Imager 
and the 32 P/ 32 S-ratio calculated. Fig. 7 shows that Smad7 potently inhibited phosphorylation of 
by BMPR-IB -mediated Smadl or Smad5 phosphorylation in a dose-dependent manner. 
Consistent with the differential receptor binding, BMPR-IB-mediated Smad phosphorylation was 
more efficiently inhibited by Smad7 than was ActR-I-mediated Smad phosphorylation. 

10 

Example 7: Analysis of Smad7 N-terminal and C-terminal domain function 

To determine if the C-terminal domain of Smad7 associates with the TGF-P receptor 
complex, COS cells were transfected with wild-type N-terminally tagged mouse Smad7 (F- 
mSmad7) or a N-terminally Flag tagged mouse Smad7 C-terminal domain (amino acids 204-426 

15 of mSmad7) in combination with wildtype TGF-P receptors. The receptors were covalently 

affinity labeled with 125 I-TGF-pl. Immunoprecipitates were analyzed by SDS-PAGE and FujiX 
Biolmager. Expression of receptors and Smad7 after transfection was determined by 
immunoprecipitation with VPN antisera against TpR-I on aliquots of cell lysates, and by 
immunoprecipitation with Flag antibody on cell lysates from 35 S methionine/cysteine labeled 

20 transfected cells. As shown in Fig. 8A, the TGF-p receptor complex coprecipitates with Smad7 
C-terminal domain. Therefore, the Smad7 C-terminal domain associates with TGF-P receptors. 
Next, the C-terminal domain was tested in an assay of TGF-P-induced p3TPLux response, as 
described above. As shown in Fig. 8B, the C-terminal domain inhibits, albeit less efficiently as 
wild-type Smad7, the TGF-P-induced p3TPLux response. Fig. 8B, panel 2 shows that 

25 transfection of Smad7C blocks TGF-p-induced p3TPLux transcriptional response, but less 
efficiently than will-type Smad7. 

The inhibitory effect of N- and C-terminal Smad7 domains and Smad8 on TGFP 
superfamily receptor responses was tested. Smad8 and Smad7 deletion constructs were tested for 
effects on TpR-I-mediated phosphorylation of Smad2 or Smad3. COS cells were transfected 

30 with TpR-I and TpR-II and indicated Smads. Phosphorylation of Smads was determined by a 
32 P orthophosphate labeling assay. Fig. 8C shows that Smad8 and N- and C-terminal (MH2) 
domains of Smad7 each inhibited TpR-I -mediated phosphorylation of Smad2 or Smad3. The 
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effect of the C-terminal domain was achieved at lower expression levels than the N-terminal 
domain. Consistent with the ability of the C-domain of Smad7 to inhibit TpR-I-mediated 
phosphorylation of Smad2, this domain was sufficient to inhibit TGF-P-induced transcriptional 
activation, although it was less efficient at doing so than was wild-type Smad7 (Fig. 8B, panel 2). 
5 Deletion of the last 19 amino acids residues completely abrogated the ability of the Smad7 C- 
domain to inhibit TGF-P-induced transcriptional activation of the 3TP promoter. 

Fig. 8D shows the inhibitory effect of Smad7 N- and C-terminal domains and Smad8 (as 
compared to Smad7 wild-type) on TGF-p receptor mediated p3TPLux transcriptional response. 
MvlLu cells were transfected with p3TP-Lux reporter construct with or without different 
10 quantities of inhibitory Smads. The induction of luciferase expression upon cell incubation with 
TGF-p was measured. Consistent with their differential phosphorylation inhibitory effects, 
Smad8 and the Smad7 domains inhibited the TGF-p mediated transcriptional response of the 
reporter gene. 

Fig. 8E shows that Smad7C inhibits TpR-I-mediated phosphorylation of Smad2, but less 
15 efficiently than wild-type Smad7. In the absence of inhibitory Smad, TGF-p 1 stimulated a 1.6- 
fold increase in Smad2 phosphorylation level, which was completely blocked by cotransfection 
with l//g wild-type Smad7, whereas in the presence of lywg Smad7C a 1.2-fold increase in TGF- 
P-mediated Smad2 phosphorylation was observed. COS cells were transfected with F-Smad2 
alone or together with F-Smad7C or wild-type F-Smad7 in the presence of BMPR-IB and 
20 BMPR-II; cells were then incubated in the absence or presence of BMP-7, whereafter they were 
labeled with [ 32 P]orthophosphate and subjected to immunoprecipitation with Flag antisera. 
Expression of F-Smad2, F-Smad7 and F-Smad7C was determined by immunoblotting with Flag 
on aliquots of cell ly sates. 

25 Example 8: Smad6 is Highly Conserved and Most Closely Related to Smad7 

A database search for human and mouse sequences related to Smad7 revealed the 
existence of an expressed sequence tag (EST; Genbank accession number N95582) 
corresponding to a novel human Smad. By screening of a human placenta cDNA library we 
obtained a 3.2 kb cDNA clone. The entire coding region of this clone was sequenced using ABI 
30 Prism 310 Genetic Analyzer, and sequence analysis was performed with DNASTAR. Sequence 
analysis revealed 92% amino acid similarity to mouse Smad6 (Imamura et aL, Nature, 389:622- 
626, 1997), and therefore this gene was termed human Smad6 (hSmad6) (Fig. 9A). Fig. 9A 
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shows sequence comparison of human and mouse Smad6 and human Smad7. Identical residues 
are boxed. The borders of the Mad-homology MH2 domain are indicated by arrows. The 
Genbank accession number for human Smad6 is AF043640. The sequence identity between 
human and mouse Smad6 is lower compared to that observed with other Smads, e.g., human and 
5 mouse Smad7 are 98% identical (Hayashi et al., 1997; Nakao et al., 1997). Fig. 9B shows 
pairwise alignment relationship between hSmadl through hSmad7 and hSmad9. Among the 
human Smads identified to date, hSmad6 is most closely related to hSmad7 (41% overall 
sequence identity). Like the N-terminal domain of Smad7, the N-terminal domain of Smad6 
shows very weak similarity to MH1 domains from Smadl through Smad5. Notably hSmad6 

10 lacks the conserved SS(M/V)S motif in its carboxy-terminal tail, which in the case of pathway- 
restricted Smads is phosphorylated by appropriate type I receptor kinases. Recently Hata et al. 
reported a hSmad6 sequence (accession number AF035528) (Hata et al., Genes Dev., 12:186- 
197, 1998). Comparison of the two sequences reveals one nucleotide difference in the coding 
region resulting in one amino acid difference (codon 21 in AF043640 is predicted to be an 

1 5 aspartic acid residue) . 

Example 9: Smad6 and Smad7 Expression in Lung Cancer Cell Lines 

As previously reported, the distribution of Smad6 mRNA in various human tissues 
revealed that Smad6 was broadly expressed in various human tissues. One Smad6 transcript of 

20 approximately 3 kb was detected (Imamura, T., et al. 1997). Interestingly, the expression 

profiles of Smad6 and Smad7 (Nakao et al., 1997) were very similar with the highest expression 
in the lung. Therefore, a panel of 10 different lung carcinoma cell lines (four SCLC and six non- 
SCLC) was examined for Smad6 and Smad7 mRNA expression (Fig. 9C). The highest Smad6 
mRNA expression was detected in the SCLC cell lines H-69 and H-82, and in the non-SCLC cell 

25 lines H-661 and H-23, whereas the SCLC cell line U-1690, H-69 and H-82, and the non-SCLC 
cell lines H-157 and H-125 expressed the highest levels of Smad7 mRNA. Thus, the two genes 
are differently expressed, and no correlation was observed between Smad6 and Smad7 
expression in these cells. 



30 



Example 10: TGF-P Family Members Induce Smad6 and Smad7 mRNA in MvlLu and 
HaCat Cells 

Mink lung epithelial (MvlLu) and human keratinocyte (HaCat) cell lines are responsive 
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to TGF-pi, activin and BMP-7 (Yashamita et al. J. Cell Biol 130:217-226, 1995). Smad7 
mRNA is rapidly included by TGF-pl (Nakao, A., 1997). Northern blot analysis of Smad6 and 
Smad7 expression was performed on RNA prepared from MvlLu and HaCat cells stimulated 
with TGF-pi (10 ng/ml), activin (50 ng/ml) and BMP-7 (500 ng/ml). Phosphor Imager 
5 quantitation revealed that (after 90 min of ligand stimulation and normalizing with GAPDH 
mRNA expression level) in MvlLu cells (Fig. 10A Panel 1), TGF-P 1, activin and BMP-7 
induced Smad6 mRNA expression 6-, 2- and 3-fold, respectively, and induced Smad7 mRNA 
expression 5-, 2-, and 3-fold, respectively, and in HaCat cells (Fig. 10A Panel 2), TGF-pl, 
activin and BMP-7 induced Smad6 mRNA expression 4-, 2- and 12-fold, respectively, and 

10 included Smad7 mRNA expression 6-, 3-, and 6-fold, respectively. Both genes were rapidly 
induced by all three ligands (Fig. 10A). Smad6 and Smad7 were induced with similar kinetics 
by TGF-p and activin, with a major peak of expression after 90 min of stimulation. A second 
peak of Smad6 and Smad7 mRNA expression was observed after 24 hr of stimulation MvlLu 
cells, in particular after TGF-pl stimulation (Fig. 10A). There was a difference in the BMP7- 

15 induced mRNA expression profiles for Smad6 and Smad7 in MvlLu cells: whereas Smad7 
mRNA expression peaked at 90 min and was reduced to low levels thereafter, Smad6 mRNA 
remained high after 90 min of stimulation (Fig. 10A). Treatment of cells with 0.5% FCS, prior 
to ligand treatment, was found to decrease the basal level of inhibitory Smad expression, making 
the mRNA inductions observed with ligands more pronounced. The expression of Smad6 and 

20 Smad7 mRNA after stimulation with ligands were similar in cultures of high and low cell 
density. 

As shown in the Examples above, Smad7 not only inhibits TGF-P and activin, but also 
BMP signaling. Smad7 associates with BMP type I receptors and inhibits BMPR-I-mediated 
signaling. Thus Smad7 may exert a negative feedback control on signaling by TGF-pl, activin 

25 as well as BMP-7. 

Example 11: Smad6 is a Direct Target Gene for TGF-pi 

Previously it was demonstrated that the induction of Smad7 mRNA by TGF-P 1 was 
observed in the presence of cycloheximide (CHX), indicating that de novo protein synthesis is 
not required for this response. Induction by TGF-P 1 of Smad6 mRNA was also seen after 

30 addition of CHX (Fig. 10B). CHX (20 jig/ml) was added 30 min before TGF-P 1. The induction 
of Smad6 mRNA was prolonged in the presence of both TGF-P 1 and CHX when compared to 
TGF-pl alone (Fig. 10B), probably as a result of CHX-induced loss of transcriptional repressors 
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or increase in Smad6 mRNA stability. Based upon these results, Smad6 and Smad7 transcription 
are likely to be directly regulated by pathway-restricted and common mediator Smads. 
Therefore, the effect of TGF-pl on MDA-MB-468 breast cancer cells that lack Smad4 (Lagna et 
al., Nature,, 383:832-836, 1996) was examined (Fig. 10C). Smad6 mRNA expression was not 
5 affected by TGF-P 1 in these cells. Smad7 expression was below detection level in the absence 
or presence of TGF-pl, or possibly the Smad7 gene may be absent in these cells. Taken 
together, the results indicate that Smad6 and Smad7 are immediate-early response genes for 
TGF-P family members. 

10 Example 12: Transfection of Anti-sense Expression Construct Enhances Cellular 
Responsiveness to TGF-pi 

To investigate whether the expression level of inhibitory Smads modulate the 
responsiveness to TGF-P, TGF-pl -induced transcriptional response was measured in MvlLu 
cells transfected with an anti-sense Smad7 cDNA expression construct. MvlLu cells that are 

15 grown in the presence of serum have relatively high constitutive Smad7 expression (Fig. 9C), 
and Smad7 is rapidly induced by TGF-P 1 (Fig. 10A). Transfection of anti-sense Smad7 
expression construct (mSmad7r), but not empty vector or anti-sense Smad3 expression construct 
(hSmad3r), increased TGF-P 1 -mediated p3TPLux transcriptional response in MvlLu cells. 
TGF-pl (10 ng/ml) was added 4 h prior to lysis of cells to measure luciferase activity. Data are 

20 presented as mean +/- SEM. Transfection of MvlLu cells with 20 \ig of anti-sense Smad7 
expression plasmid increased the TGF-pl response, measured as activation of the PAI-1 
promoter containing reporter, (p3TPlux) after a 4 h stimulation. The increased TGF-P response 
in the Smad7 anti-sense transfected cells was 2-4 fold compared to control cells transfected with 
empty expression vector (Fig. 11); the response was dependent on the amount of transfected anti- 

25 sense expression construct. Transfection with anti-sense Smad3 expression construct showed no 
or very weak inhibitory response (Fig. 11). These results indicate that the expression level of 
Smad7 determines the cellular responsiveness to TGF-pl. 

Example 13: Effect of EGF and PMA on Smad6 and Smad7 mRNA Expression 

30 To examine the possibility of cross-talk between different signaling pathways, the effects 

of forskolin (activator of cAMP), epidermal growth factor (EGF) and phorbol ester PMA 
responses were determined in the absence and presence of 1 0 ng/ml TGF-P 1 . Forskolin had no 
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effect alone, and only slightly enhanced the TGF-p 1 -induced expression of Smad6 and Smad7 
mRNA. EGF (10 ng/ml) by itself was capable of inducing Smad6 and Smad7 expression, and 
acted synergistically with TGF-p 1 for the induction of Smad7 mRNA (Fig. 12A). EGF was 
added 60 min. before TGF-P 1. PMA (10~ 8 M) alone had no appreciable effect, but a strong 
5 synergistic response was observed when PMA was added together with TGF-p 1 on the induction 
of Smad6 and Smad7 expression (Fig. 12B). In this particular experiment the TGF-p 1 -mediated 
induction of Smad6 and Smad7 mRNA was very low, but could be observed upon longer 
exposure to X-ray film, thereby making synergistic effect more apparent. PAI-1 gene 
expression, which was found to be more rapidly induced by TGF-P 1 (peak at 90 min) than PMA 
10 (peak at 4 h), was included as a control to show effect of PMA and TGF-p 1 on cells. 

Example 14: Smad7 is More Potent than Smad8 In Inhibiting TGF-P-induced Responses 

Xenopus Smad8 shares 96% amino acid identity with murine Smad7 within the C-domain 
(Nakayama et al, 1998). Given that this domain is sufficient for receptor binding, and for 

15 inhibition of several TGF-p-induced responses, one would predict that these proteins would 
behave identically with regard to receptor binding and inhibition of downstream signaling. 
Consistent with this possibility, Smad8 interacted with the activated TpR-I, BMPR-IA, BMPR- 
IB and ActR-I. In addition, Smad9 inhibited BMPRI -mediated phosphorylation of Smadl and 
SmadS in a dose-dependent fashion that was indistinguishable from that of murine Smad7. Fig 

20 13A shows that Smad8 inhibits T pR-I-mediated phosphorylation of Smad2 less efficiently than 
Smad7. Cells were transfected with F-Smad2 alone or together with F-smad7 or F-Smad8 in the 
presence of T pR-I and T PR-II; cells were then incubated in the absence or presence of TGF-p. 
The level of F-Smad2 phosphorylation was determined by [ 32 P]orthophosphate labeling and 
immunoprecipitation of cell lysates with Flag antisera. Expression of F-Smads was analysed by 

25 Western blotting with Flag antiserum on aliquots of cell lysates. Fig. 13B shows that 

transfection of Smad7 in MvlLu cells blocks the TGF-P-induced p3TPLux response more 
effectively than Smad8. Thus, while Smad7 efficiently inhibited TpR-I-mediated 
phosphorylation of Smad2, Smad8 was less effective at doing so (Fig. 13 A). Interestingly, this 
correlated with the more pronounced effect of Smad7 versus Smad8 in inhibiting TGF-P-induced 

30 transcriptional activation of the p3TPLux reporter gene (Fig. 13B). Of note, Smad7 is more 
effective than is Smad8 at inhibiting activin signaling in Xenopus mesoderm induction assay 
(Nakayama et al., 1998). 
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Example 15: Partial Secondary Axis Formation by Smad7 and Smad8 

To determine whether Smad7 can inhibit transduction of BMP signals in vivo patterning 
defects caused by overexpression of Smad7 in Xenopus embryos were analyzed. Fig. 14A shows 
photomicrographs of control tadpole and Fig. 14B shows sibling tadpole made to misexpress 
5 Smad7 in ventral cells. Note induction of partial secondary dorsal axis (arrow). Fig. 14C and 
Fig. 14D (control) depict Smad7-RNA injected sibling showing immunoreactive muscle in 
primary (arrow) and secondary (arrowhead) axes. Fig. 14E shows eye defects and spina bifida in 
an embryo made to overexpress Smad8 in dorsal cells. Fig. 14A shows fusion of eyes in an 
embryo made to overexpress Smad7 in dorsal cells. Overexpression of known BMP antagonists, 

10 such as dominant negative BMP receptors or ligands, on the ventral side of Xenopus embryos 
can induce the formation of a partial secondary axis (reviewed inGraff Cell 89:171-174, 1997). 
Injection of 200 or 400 pg of RNA encoding Smad7 near the ventral midline of four-cell 
embryos led to formation of a secondary dorsal axis in 90% (n=100) or 96% (n=79) of embryos, 
respectively (Fig. 14B). Secondary axis formation was observed in 94% (n=98) of embryos 

15 injected with 200 pg of RNA encoding that isolated C-domain of Smad7. Axes induced by 
either Smad7 or the C-domain of Smad7 contained immunoreactive muscle (Fig. 14D) but 
lacked notochord. In the same series of experiments, injection of 200 or 400 pg of RNA 
encoding Xenopus Smad8 near the ventral midline induced the formation of a secondary axis in 
87 (n=92) or 91% (n=101) of injected embryos, respectively. One notable difference between 

20 the axes induced by the two gene products is that the secondary axes induced by Smad8 included 
a cyclopic or fused eye in 10% (200 pg) or 41% (400 pg) of cases whereas eye formation was 
rarely observed (3% of embryos) following misexpression of Smad7, and then only when a high 
dose (400 pg) of RNA was injected. 

25 Example 16: Ectopic Expression of Smad7 in Dorsal Cells Induces a Subset of the 
Patterning Defects Caused by Overexpression of Smad8 

Ectopic expression of Smad8 within dorsal cells of Xenopus embryos produces a range of 
phenotypic defects which cannot be attributed to blockade of BMP signaling. These include 
failure of the neural folds to fuse (spina bifida) as well as a range of eye defects (Nakayama et 
30 al., 1998). To directly compare patterning defects cause by misexpression of Smad7 and Smad8 
within dorsal cells, 200 pg of RNA encoding either protein was injected near the dorsal midline 
of four-cell embryos. While 20% of Smad8 -injected embryos developed spina bifida, and 40% 
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showed a decrease in eye pigmentation (n=50), 97% of Smad7-injected embryos developed 
completely normally (n=89). When a higher dose (400 pg) of RNA encoding either Smad8 or 
Smad7 was injected, 40% of Smad8 -injected embryos developed with spina bifida (Fig. 14E, 
n=124) while only 7% of embryos injected with Smad7 RNA (n=140), and 3% of embryos 
5 injected with a control RNA (encoding a myc epitope tag, n=70) displayed this defect. In 
contrast, 43% of Smad7-injected embryos developed with variable eye defects ranging from a 
slight to moderate decrease in eye pigment to fusion of the eyes (Fig. 14F). Forty-seven per cent 
of Smad8-injected siblings developed with comparable eye defects (Nakayama et al., 1998). The 
percentage of Smad8 -injected embryos displaying spina bifida, eye induction or eye defects was 

10 somewhat lower in these experiments than that previously reported (Nakayama et al., 1998), 
possibly due to inherent differences between embryos used for the two sets of experiments. 
Together, these results suggest that Smad7 and Smad8 can interact with common signaling 
pathways in vivo although Smad8 may target additional, as of yet unidentified, pathways. 
Therefore, overexpression of Smad8 in Xenopus embryos produces patterning defects that are not 

15 observed following overexpression of Smad7, suggesting that Smad7 and Smad8 may 

preferentially target distinct signaling pathways. While it cannot be ruled out that the possibility 
that these differences are due to species specificity, e.g., the possibility that mammalian Smad7 is 
more efficiently translated, or interacts more efficiently with receptors in mammalian cells than 
does Xenopus Smad8, the results do not support this interpretation. Specifically, Smad7 is a 

20 more potent antagonist of TGF-p/activin signaling in both mammalian and Xenopus systems 
than is Smad8. 



Example 17: Smad7 but not Smad6 inhibits TGF-pi-induced growth inhibition. 

In order to investigate the effect of inhibitory Smads on TGF-pl -mediated growth 
25 inhibition, mouse Flag-tagged Smad6 (F-Smad6) and F-Smad7 were stably transfected into 
MvlLu cells; Smad6 and Smad7 were placed under transcriptional control of the human 
inducible metallothionien IIA promoter using the pMEP4 expression vector. Both reported 
forms of Smad6 were tested; a mouse long Smad6 version (Smad6L; Imamura et al., Nature 
398:622-626, 1997) as well as a human short Smad6 version (Smad6S; Topper et al., PNAS 
30 94:9314-9319, 1997). Fig. 15 shows characterization of Smad expression level in stable 

transfected MvlLu cell lines in the absence or presence of zinc chloride; wild-type F-Smad7 
(pMEP4-smad7; three independent clones, 7-3, 7-5S and 7-10), the N-terminal domain of Smad7 



WO 98/53068 PCT/US98/1 0305 

-58- 

(pMEP-7N, clone 7N-5), Smad7 with deletion at C-tail (pMEP4-7NCA, clone 7NCA-7), the C- 
terminal domain of Smad7 (pMEP4-7CA, clone 7CA-7) and the "short" variant of Smad6 
(pMEP4-Smad6S, clone S6-2)and the "long" variant of Smad6 (pMEP4-Smad6L, clone L6-3) 
expression results are shown. Cells were metabolically labeled, and cell ly sates subjected to 
5 immunoprecipitation with Flag antibody. Immunoprecipitates were analyzed by SDS-PAGE. 
When the expression of Smad6S, Smad6L and Smad7 was analyzed, we observed that the 
inhibitory Smads were already expressed in the absence of the inducer zinc chloride. This is 
result of leakage of the metallothionien promoter. Pretreatment of cells with zinc chloride in 
each transfectant led to increase in Smad6 and Smad7 expression (Fig. 15). Subsequently, we 

10 tested the response of the Smad7 and Smad7 (and empty pMEP4) transfectants for TGF-P 1- 
induced growth inhibition in the absence or presence of zinc chloride. Multiple independent 
Smad7-expressing clones were analyzed. We found that Smad7 prevented the TGF-pl -induced 
growth-inhibitory effects in MvlLu (Fig. 16A-C). In all Smad7 clones TGF-P 1 -induced growth 
inhibition was blocked to an extent which correlated with their different expression levels of 

15 Smad7. pMEP4-transfected MvlLu cells, in the absence or presence of zinc chloride showed a 
similar dose response curve for TGF-pi as nontransfected MvlLu cells. Ectopic expression 
Smad6L or Smad6S did not affect the TGF-pl -induced growth inhibition (Fig. 16D,E). Fig 16 
shows the effect of TGF-pl on pMEP4 transfected cells in the absence or presence of zinc 
chloride. The relative growth compared to control is plotted against the concentration of TGF- 

20 pi. Different lots of TGF-P 1 were used in experiments A-C versus D,E, explaining why there is 
a difference between the ED50's on pMEP4 transfected cells in these experiments. All data 
shown are means +/-SED. Thus Smad7 can antagonize TGF-pl -induced growth inhibition, and 
appears to be more effective than Smad6. 

25 Example 18: Smad7 inhibits TGF-pi-induced transcriptional responses. 

Subsequently we examined the effect of ectopic Smad7 expression on TGF-P 1 -induced 
expression of the endogenous early response genes, Smad7, PAI-I and JunB (Fig. 17). In Fig. 17 
it is shown that ectopically expressed Smad7 inhibits TGF-P 1 -mediated induction of JUNB (A), 
Smad7 (B) and PAI-1 mRNA (E) expression. Northern blot analysis on RNA from MvlLu cells 
30 stably transfected with pMEP4-Smad7 without or with pretreatment with zinc chloride on 
exposure to TGF-pl. The endogenous mRNAs for junB, Smad7 and PAI-1 are indicated by 
arrows. The asterisk in Smad7 blot indicates the transfected F-Smad7 mRNA. The amount and 
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intactness of total RNA loaded was checked by ethidium bromide staining (C and F). For junB 
(A) and Smad7 probe (B) the same blot was used, which was different from blot for PAI-1 1 
probe (E). After treatment of cells with zinc chloride for 20 h prior to TGF-pl stimulation for 2 
h, the ectopically induced Smad7 was found to inhibit TGF-pl -induced immediate early gene 

5 responses at a Smad7 mRNA extopic expression level similar to that of endogenous gene after 
TGF-p treatment (Fig. 17B). Interestingly, upon treatment with zinc chloride for 6h, no 
significant effect on TGF-P 1 stimulation of early response genes in the Smad7 expressing clones 
was observed, whereas the apparent ectopic Smad7 mRNA levels were higher after 6 h versus 
20h pretreatment. Possibly differences in Smad7 mRNA do not correlate exactly with Smad7 

10 protein levels upon zinc chloride treatment, or a particular post-translationally modification that 
occurs with slow kinetics may be required for interaction of Smad7 with receptor in MvlLu 
cells. 



Example 19: TGF-pl-induced nuclear export of Smad7 in transfected COS cells and 
15 nontransfected MvlLu cells 

The subcellular localization of F-Smad6, F-Smad7 and F-Smad2 in transfected COS cells 
was analyzed in the absence or presence of the constitutive active TpR-I(T204D) by 
immunofluorescence using Flag antibody (Fig. 18A). In Fig. 18, Panel A shows the subcellular 
distribution of F-Smad7, F-Smad7 and F-Smad2 in the absence or presence of constitutive TpR- 

20 I(T204D) mutant in transfected COS cells. Smads were localized in the cells by 

immunofluorescence using Flag antibody. Panel B shows the quantitation of nuclear versus 
cytoplasmic cell staining for F-smads in the absence or presence of T pR-I(T204D). Panel C 
shows the subcellular distribution of Smad7 and Smad6 in nontransfected MvlLu cells in the 
absence and presence of TGF-pl . The Smads were localized in the cells using specific antisera 

25 to Smad6 (ESP) or Smad7 (KER). Smad6 was mainly located in the cytoplasm of the cells and 
the subcellular localization was not different from Smad6 cotransfection with TP R-I(T204D), 
constitutively active TpR-1 . Smad7 was located in the cell nuclei but exported in a large 
proportion of total treated cells from the nuclei to the cytoplasm upon cotransfection with T pR- 
I(T204D) ? which is in agreement with earlier reports (Eppert et al. 1996; Nakao et al., 1997). 

30 The quantitation of nuclear versus cytoplasmic staining for the different Smads in the absence or 
presence of TPR-I(T204D) are presented in Fig. 18B. The nuclear localization was checked by 
DAPI staining. The observed differences in subcellular distribution of Smad7 versus Smad2 and 
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Smad6, suggest a functional significance of TpR-I-mediated export of Smad7 from the nucleus. 
However, over expression in COS cells and use of epitope tag may affect Smad7 subcellular 
distribution. Therefore, the subcellular distribution of inhibitory Smads in nontransfected 
MvlLu cells using antisera that specifically recognize Smad6 and Smad7 was also examined. In 
5 agreement with results in transfected COS cells, a cytoplasmic staining for Smad6 in the absence 
or presence of TGF-pi (or BMP-7) was observed. In some experiments, a TGF-pi -induced 
nuclear translocation was observed in some of the cells (Fig. 18C). In the absence of ligand, a 
nuclear staining was observed for Smad7. However, upon TGF-pi stimulation a cytoplasmic 
accumulation of Smad7 was induced (Fig. 18C). Two different antisera for each of Smad6 
10 (recognizing Smad6L but not Smad6S) and Smad7, raised against different epitopes, gave 
identical results. 



Example 20: Differential localization of Smad7 deletion mutants in the absence or presence 
ofTGF-pl 

15 The subcellular localization of F-Smad6 and F-Smad7 in MvlLu cells stably expressing 

F-Smad7 and F-Smad7 were analyzed in the absence or presence TGF-pi using the Flag 
antibody and immunofluorescence. In Fig. 19, the subcellular distribution of Smad7 mutants in 
the absence or presence of TGF-pl is shown. The cells were pretreated with zinc chloride. 
Smads were localized in the cells using Flag antibody. Identical results were obtained, compared 

20 to the experiments in which transfected COS cells and nontransfected MvlLu cells were used; 
cytoplasmic staining for F-Smad6L and Smad6S versus a TGF-pi -induced nuclear export for F- 
Smad7 (Fig. 19). To gain more insight into regions in Smad7 that are important for nuclear 
localization and TGF-pl -induced nuclear export, the subcellular distribution of different Smad7 
deletion mutants were analyzed. pMEP4 expression constructs for the C-terminal domain of F- 

25 Smad7 (7C; amino acids 204-426). F-Smad7C with C-tail deletion (7CA; amino acids 204-407), 
Smad7 with deletion of C-tail (7A; amino acids 1-407), and N-terminal domain of Smad7 (7N; 
amino acids 1-203) were stably transfected into MvlLu cells, and cell lines characterized for 
Smad7 expression upon zinc chloride treatment (Fig. 15). All cells showed some degree of leaky 
expression, but in all cell lines zinc treatment included expression of the Smad protein. 

30 Subcellular distribution of the 7C mutant (after zinc chloride treatment) in the absence or 

presence of TGF-pi was similar to that of wild-type Smad7, although its efficiency of nuclear 
export was much less than that of wild-type Smad7. An intact MH2 domain appeared important 
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for nuclear localization as 7CA and 7 A mutants were predominantly in cytoplasm in the absence 
of TGF-pi . Smad7N was found in the cytoplasm in the absence or presence of TGF-pl ; it had a 
spotted localization suggesting association with a particular cell structure (Fig. 19). None of the 
Smad7 deletion constructs were able to interfere with TGF-pl -induced growth inhibition as 
5 observed for Smad7 wild-type. 

Example 21: Characterization of the Mouse Smad7 Promoter 

The mouse Smad7 promoter was isolated from a commercially available genomic DNA 
10 library of the 129/Sv mouse strain (Stratagene, La Jolla, CA), using a ~800bp Smad7 cDNA 
fragment as a probe (nucleotides 883-1614 of SEQ ID NO. 3). 

The putative Smad7 promoter that was isolated comprised of a ~4,465bp Xho I-Xho I 
fragment from the 5'-end of the Smad7 mouse genomic DNA. A single Bam HI site was 
identified by restriction mapping ~3,840bp from the 5 -Xho I site. In order to determine the 
15 promoter properties of the ~4,465bp Xho I-Xho I DNA, three different regions of this DNA 
fragment (i.e., ~4,465bp Xho I-Xho I, ~725bp Bam Hl-Xho I, and ~3,840bp Xho I-Bam HI) 
were inserted 5 ! of the promoter-less luciferase reporter gene in the commercially available 
pGL3-Basic expression vector (Promega, Madison, WI) and transfected into HepG2 cells. 
Following transfection, the cells were starved overnight in DMEM (0.3% FBS) and later 
20 stimulated with 10 ng/ml of TGF-pl or vehicle (negative control) for 16 hours. The cells were 
lysed and subjected to luciferase analysis. These results, depicted in Figure 21, show that both 
the ~4,465bp Xho I-Xho I and the ~725bp Bam Hl-Xho I Smad7 DNA fragments but not the 
~3,840bp Xho I-Bam HI fragment, confer TGF-pi inducibility to the heterologous luciferase 
transcription unit. 

25 The 725bp Bam Hl-Xho I Smad7 promoter fragment was further characterized. Its 

sequence is depicted as SEQ ID NO. 15 of the sequence listing. The transcriptional start site was 
localized by an RNAase Pretection Assay and was localized at — nucleotide 499 of SEQ ID NO. 
15. This start site was also found to coincide with the predicted start site as determined by 
computer software programs searching for such sites (Wingender, E. et al., Biotechnology, 1994, 

30 35:273-280; Prestidge, D. et al., J Mol Biol, 1995, 249:923-932). Figure 22 depicts several 
restriction enzyme sites and putative binding sites for different transcription factors and Smad 
proteins in the Bam Hl-Xho I promoter fragment. The putative binding sites for non-Smad 
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transcription factors were identified using the well known in the art TF SEARCH computer 
program and the TRANSFAC MATRIX TABLE database. Putative Smad binding regions were 
assessed using sequence information from the literature. 

All references disclosed herein are incorporated by reference in their entirety. 
5 A Sequence Listing is presented followed by what is claimed: 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION 



(i) APPLICANT: 

(A) NAME: LUDWIG INSTITUTE FOR CANCER RESEARCH 

(B) STREET: 1345 AVENUE OF THE AMERICAS 

(C) CITY: NEW YORK 

(D) STATE: NEW YORK 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE: 10105 



(ii) TITLE OF THE INVENTION: SMAD7 AND USES THEREOF 



(iii) NUMBER OF SEQUENCES: 15 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Wolf, Greenfield & Sacks, P.C. 

(B) STREET: 600 Atlantic Avenue 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: U.S.A. 

(F) ZIP: 02210-2211 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 20-MAY-1998 

(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/047,221 

(B) FILING DATE: 20-MAY-1997 

(A) APPLICATION NUMBER: 60/060,465 

(B) FILING DATE: 30-SEP-1997 

(A) APPLICATION NUMBER: 60/075,940 

(B) FILING DATE: 25-FEB-1998 

(A) APPLICATION NUMBER: 60/077,033 

(B) FILING DATE: 06-MAR-1998 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Van Amsterdam, John R. 

(B) REGISTRATION NUMBER: 4 0,212 

(C) REFERENCE/DOCKET NUMBER: L04 61/7032 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-720-3500 

(B) TELEFAX: 617-720-2441 
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(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



CAGCAACCAT GGACGGGTTT CACCGTGCAG ATCAGCTTTG TGAAGGGCTG GGGCCAGTGC 6 0 

TACACCCGCC AGTTCATCAG CAGCTGCCCG TGCTGGCTGG AGGTCATCTT CAACAGCCGG 12 0 

TAGTCGGTCG TGTGGTGGGG AGAAGAGGAC AGGGCGGATC GTGAGCCGAG CAGGCCACCG 18 0 

TTCAAACTAC TTGCTGCTAA CCTTTCCCGA GTGATTGCTT TTCATGCAAA CTCTTTGGTT 24 0 

GGTGTTGTTA TTGCCATTCA TTGTTGGTTT TGTTTTGTTC TGTTCTGGGT TCAGCCTGGC 3 00 

CTGCCCAGCC CTGGTCATCC AGCTGTACTG GCCCCTGGGG GGGTTCTGAG CAGTGCCTCT 360 

TGTCTTGGAG AC AG AC C TGG TGCCTGGCGC TGCCTTCAGC AGGCAGCAGG CAGCCTCCTG 42 0 

CACACTGGCT TTTTTAGTCA TTTATGGGCA AAAAGAGTTA AAGTAAAACT TTGCAAATCC 4 80 



(2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

ACTCTAGTTC ACAGAGTCGA CTAAGGTGAT GGGGGTTGCA GCACACCAGC TCGGGGTTGA 6 0 

TCTTCCCGTA AGATTCACAG CAACACAGCC TCTTGACTTC CGAGGAATGC CTGAGATCCG 12 0 

GCCACCTGAA CACTTTGCAC AGCAGGAGGG GGAGCGAGTA GGACGAGGGC GGCTGCACTG 18 0 

GCTGCCGCTG GCGGGCACGC CCGGGCCCAG CCTGCAGTCC AGGCGGCCGG GCAGCAGGAG 24 0 

GCACGCGGTG CGCGTACCGC CGCGGGACTC CACGGCCTGA AGCAGCAGCT CCAGC 2 95 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1945 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 308... 1585 
(D) OTHER INFORMATION: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CGTTCCTCCA GGGTCACGCC GGGGCCCGAA AGCCGCGCAG GGCGCGGGCC GCGCCGGGTG 6 0 

GGGCATCCGA AGCGCAGCCC CCCGATCCCC GGCAGGCGCC CTGGGCCCCC GCGCGCGCCC 12 0 

CGGCCTCTGG GAGATGGCGC ATGCCAGGAG GGCCCCTCCG GCCGCCGCCG GTTCTGCCCG 18 0 

GGCCCCTGCT GTTGCTGCTG TCGCCTGCGC CTGNTGCCCC AAGTCGGCGC CCGATTNTTC 24 0 

ATGGTGTGCG GAGGTCATGT TCGCTCCTTA GCCGGCAAAC GATTTTCTCC TCGCCTCCTC 3 00 

GCCCCGC ATG TTC AGG ACC AAA CGA TCT GCG CTC GTC CGG CGT CTC TGG 34 9 
Met Phe Arg Thr Lys Arg Ser Ala Leu Val Arg Arg Leu Trp 
15 10 

AGG AGC CGT GCG CCC GGC GGC GAG GAC GAG GAG GAG GGC GTG GGG GGT 3 97 

Arg Ser Arg Ala Pro Gly Gly Glu Asp Glu Glu Glu Gly Val Gly Gly 
15 20 25 30 

GGC GGC GGA GGA GGC GAG CTG CGG GGA GAA GGG GCG ACG GAC GGC CGG 44 5 

Gly Gly Gly Gly Gly Glu Leu Arg Gly Glu Gly Ala Thr Asp Gly Arg 
35 40 45 

GCT TAT GGG GCT GGT GGC GGC GGT GCG GGC AGG GCT GGC TGC TGC CTG 4 93 

Ala Tyr Gly Ala Gly Gly Gly Gly Ala Gly Arg Ala Gly Cys Cys Leu 
50 55 60 

GGC AAG GCA GTC CGA GGT GCC AAA GGT CAC CAC CAT CCC CAT CCC CCA 541 
Gly Lys Ala Val Arg Gly Ala Lys Gly His His His Pro His Pro Pro 
65 70 75 

ACC TCG GGT GCC GGG GCG GCC GGG GGC GCC GAG GCG GAT CTG AAG GCG 589 
Thr Ser Gly Ala Gly Ala Ala Gly Gly Ala Glu Ala Asp Leu Lys Ala 
80 85 90 

CTC ACG CAC TCG GTG CTC AAG AAA CTC AAG GAG CGG CAG CTG GAG CTG 63 7 

Leu Thr His Ser Val Leu Lys Lys Leu Lys Glu Arg Gin Leu Glu Leu 
95 100 105 110 

CTG CTT CAG GCC GTG GAG TCC CGC GGC GGT ACG CGC ACC GCG TGC CTC 685 
Leu Leu Gin Ala Val Glu Ser Arg Gly Gly Thr Arg Thr Ala Cys Leu 
115 120 125 

CTG CTG CCC GGC CGC CTG GAC TGC AGG CTG GGC CCG GGG GCG CCC GCC 73 3 

Leu Leu Pro Gly Arg Leu Asp Cys Arg Leu Gly Pro Gly Ala Pro Ala 
130 135 140 

AGC GCG CAG CCC GCG CAG CCG CCC TCG TCC TAC TCG CTC CCC CTC CTG 781 
Ser Ala Gin Pro Ala Gin Pro Pro Ser Ser Tyr Ser Leu Pro Leu Leu 
145 150 155 

CTG TGC AAA GTG TTC AGG TGG CCG GAT CTC AGG CAT TCC TCG GAA GTC 82 9 

Leu Cys Lys Val Phe Arg Trp Pro Asp Leu Arg His Ser Ser Glu Val 
160 165 170 

AAG AGG CTG TGT TGC TGT GAA TCT TAC GGG AAG ATC AAC CCC GAG CTG 877 
Lys Arg Leu Cys Cys Cys Glu Ser Tyr Gly Lys lie Asn Pro Glu Leu 
175 180 185 190 

GTG TGC TGC AAC CCC CAT CAC CTT AGT CGA CTC TGT GAA CTA GAG TCT 92 5 
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Val Cys Cys Asn Pro His His Leu Ser Arg Leu Cys Glu Leu Glu Ser 

195 200 205 

CCC CCT CCT CCT TAC TCC AGA TAC CCA ATG GAT TTT CTC AAA CCA ACT 973 

Pro Pro Pro Pro Tyr Ser Arg Tyr Pro Met Asp Phe Leu Lys Pro Thr 

210 215 220 

GCA GGC TGT CCA GAT GCT GTA CCT TCC TCC GCG GAA ACC GGG GGA ACG 1021 

Ala Gly Cys Pro Asp Ala Val Pro Ser Ser Ala Glu Thr Gly Gly Thr 
225 230 235 

AAT TAT CTG GCC CCT GGG GGG CTT TCA GAT TCC CAA CTT CTT CTG GAG 106 9 

Asn Tyr Leu Ala Pro Gly Gly Leu Ser Asp Ser Gin Leu Leu Leu Glu 
240 245 250 

CCT GGG GAT CGG TCA CAC TGG TGC GTG GTG GCA TAC TGG GAG GAG AAG 1117 

Pro Gly Asp Arg Ser His Trp Cys Val Val Ala Tyr Trp Glu Glu Lys 

255 260 265 270 

ACT CGC GTG GGG AGG CTC TAC TGT GTC CAA GAG CCC TCC CTG GAT ATC 1165 

Thr Arg Val Gly Arg Leu Tyr Cys Val Gin Glu Pro Ser Leu Asp lie 

275 280 285 

TTC TAT GAT CTA CCT CAG GGG AAT GGC TTT TGC CTC GGA CAG CTC AAT 1213 

Phe Tyr Asp Leu Pro Gin Gly Asn Gly Phe Cys Leu Gly Gin Leu Asn 

290 295 300 

TCG GAC AAC AAG AGT CAG CTG GTA CAG AAA GTG CGG AGC AAG ATC GGC 12 61 

Ser Asp Asn Lys Ser Gin Leu Val Gin Lys Val Arg Ser Lys lie Gly 
305 310 315 

TGT GGC ATC CAG CTG ACG CGG GAA GTG GAT GGC GTG TGG GTT TAC AAC 13 0 9 

Cys Gly lie Gin Leu Thr Arg Glu Val Asp Gly Val Trp Val Tyr Asn 
320 325 330 

CGC AGC AGT TAC CCC ATC TTC ATC AAG TCC GCC ACA CTG GAC AAC CCG 13 57 

Arg Ser Ser Tyr Pro lie Phe lie Lys Ser Ala Thr Leu Asp Asn Pro 

335 340 345 350 

GAC TCC AGG ACG CTG TTG GTG CAC AAA GTG TTC CCT GGT TTC TCC ATC 14 05 

Asp Ser Arg Thr Leu Leu Val His Lys Val Phe Pro Gly Phe Ser lie 

355 360 365 

AAG GCT TTT GAC TAT GAG AAA GCC TAC AGC CTG CAG CGG CCC AAT GAC 14 53 

Lys Ala Phe Asp Tyr Glu Lys Ala Tyr Ser Leu Gin Arg Pro Asn Asp 

370 375 380 

CAC GAG TTC ATG CAG CAA CCA TGG ACG GGT TTC ACC GTG CAG ATC AGC 1501 

His Glu Phe Met Gin Gin Pro Trp Thr Gly Phe Thr Val Gin He Ser 
385 390 395 

TTT GTG AAG GGC TGG GGC CAG TGC TAC ACC CGC CAG TTC ATC AGC AGC 154 9 

Phe Val Lys Gly Trp Gly Gin Cys Tyr Thr Arg Gin Phe He Ser Ser 
400 405 410 

TGC CCG TGC TGG CTG GAG GTC ATC TTC AAC AGC CGG TAGTCGGTCG TGTGGT 16 01 
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Cys Pro Cys Trp Leu Glu Val lie Phe Asn Ser Arg 
415 420 425 

GGGGAGAAGA GGACAGGGCG GATCGTGAGC CGAGCAGGCC ACCGTTCAAA CTACTTGCTG 1661 

CTAACCTTTC CCGAGTGATT GCTTTTCATG CAAACTCTTT GGTTGGTGTT GTTATTGCCA 1721 

TTCATTGTTG GTTTTGTTTT GTTCTGTTCT GGGTTCAGCC TGGCCTGCCC AGCCCTGGTC 1781 

ATCCAGCTGT ACTGGCCCCT GGGGGGGTTC TGAGCAGTGC CTCTTGTCTT GGAGACAGAC 1841 

CTGGTGCCTG GCGCTGCCTT CAGCAGGCAG CAGGCAGCCT CCTGCACACT GGCTTTTTTA 19 01 
GTCATTTATG GGCAAAAAGA GTTAAAGTAA AACTTTGCAA ATCC 194 5 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
(v) FRAGMENT TYPE : internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Met 


Phe 


Arcr 


Thr 


Lys 


Arcr 


Ser 


Ala 


Leu 


Val 


Arcr 


Arcr 


Leu 


Trp 


Arg Ser 


1 








5 










10 










15 


Arg 


Ala 


Pro 


Gly 


Gly 


Glu 


Asp 


Glu 


Glu 


Glu 


Gly 


Val 


Gly Gly Gly Gly 








20 










25 










30 




Gly 


Gly 


Gly 


Glu 


Leu 


Arg 


Gly 


Glu 


Gly 


Ala 


Thr 


Asp 


Gly Arg Ala Tyr 






35 










40 










45 






Gly 


Ala 


Gly 


Gly 


Gly 


Gly 


Ala 


Gly 


Arg 


Ala 


Gly 


Cys 


Cys 


Leu 


Gly Lys 




50 










55 










60 








Ala 


Val 


Arg 


Gly 


Ala 


Lys 


Gly 


His 


His 


His 


Pro 


His 


Pro 


Pro 


Thr Ser 


65 










70 










75 








80 


Gly 


Ala 


Gly 


Ala 


Ala 


Gly 


Gly 


Ala 


Glu 


Ala 


Asp 


Leu 


Lys 


Ala 


Leu Thr 










85 










90 










95 


His 


Ser 


Val 


Leu 


Lys 


Lys 


Leu 


Lys 


Glu 


Arg 


Gin 


Leu 


Glu 


Leu 


Leu Leu 








100 










105 










110 




Gin 


Ala 


Val 


Glu 


Ser 


Arg 


Gly 


Gly 


Thr 


Arg 


Thr 


Ala 


Cys 


Leu 


Leu Leu 






115 










120 










125 






Pro 


Gly 


Arg 


Leu 


Asp 


Cys 


Arg 


Leu 


Gly 


Pro 


Gly 


Ala 


Pro 


Ala 


Ser Ala 




130 










135 










140 








Gin 


Pro 


Ala 


Gin 


Pro 


Pro 


Ser 


Ser 


Tyr 


Ser 


Leu 


Pro 


Leu 


Leu 


Leu Cys 


145 










150 










155 








160 


Lys 


Val 


Phe 


Arg 


Trp 


Pro 


Asp 


Leu 


Arg 


His 


Ser 


Ser 


Glu 


Val 


Lys Arg 










165 










170 










175 


Leu 


Cys 


Cys 


Cys 


Glu 


Ser 


Tyr 


Gly 


Lys 


He 


Asn 


Pro 


Glu 


Leu 


Val Cys 








180 










185 










190 




Cys 


Asn 


Pro 


His 


His 


Leu 


Ser 


Arg 


Leu 


Cys 


Glu 


Leu 


Glu 


Ser 


Pro Pro 






195 










200 










205 






Pro 


Pro 


Tyr 


Ser 


Arg 


Tyr 


Pro 


Met 


Asp 


Phe 


Leu 


Lys 


Pro 


Thr 


Ala Gly 




210 










215 










220 








Cys 


Pro 


Asp 


Ala 


Val 


Pro 


Ser 


Ser 


Ala 


Glu 


Thr 


Gly 


Gly Thr Asn Tyr 


225 










230 










235 








240 


Leu 


Ala 


Pro 


Gly 


Gly 


Leu 


Ser 


Asp 


Ser 


Gin 


Leu 


Leu 


Leu 


Glu 


Pro Gly 










245 










250 










255 


Asp 


Arg 


Ser 


His 


Trp 


Cys 


Val 


Val 


Ala 


Tyr 


Trp 


Glu 


Glu 


Lys 


Thr Arg 



WO 98/53068 



-68- 



PCT/US98/10305 









260 










265 










270 






Val 


Gly 


Arg 
275 


Leu 


Tyr 


Cys 


Val 


Gin 
280 


Glu 


Pro 


Ser 


Leu 


Asp 
2 85 


He 


Phe 


Tyr 


Asp 


Leu 


Pro 


Gin 


Gly 


Asn 


Gly 


Phe 


Cys 


Leu 


Gly Gin 


Leu 


Asn 


Ser 


Asp 




290 










295 










3 00 










Asn 


Lys 


Ser 


Gin 


Leu 


Val 


Gin 


Lys 


Val 


Arg 


Ser 


Lys 


He 


Gly Cys 


Gly 


305 










310 










315 










320 


He 


Gin 


Leu 


Thr 


Arg 
325 


Glu 


Val 


Asp 


Gly 


Val 
330 


Trp 


Val 


Tyr 


Asn 


Arg 
335 


Ser 


Ser 


Tyr 


Pro 


He 
340 


Phe 


He 


Lys 


Ser 


Ala 
345 


Thr 


Leu 


Asp 


Asn 


Pro 
350 


Asp 


Ser 


Arg 


Thr 


Leu 
355 


Leu 


Val 


His 


Lys 


Val 
360 


Phe 


Pro 


Gly 


Phe 


Ser 
365 


He 


Lys 


Ala 


Phe 


Asp 
370 


Tyr 


Glu 


Lys 


Ala 


Tyr 
375 


Ser 


Leu 


Gin 


Arg 


Pro 
380 


Asn 


Asp 


His 


Glu 


Phe 


Met 


Gin 


Gin 


Pro 


Trp 


Thr 


Gly 


Phe 


Thr 


Val 


Gin 


He 


Ser 


Phe 


Val 


385 










390 










395 










400 


Lys 


Gly 


Trp 


Gly 


Gin 
405 


Cys 


Tyr 


Thr 


Arg 


Gin 
410 


Phe 


He 


Ser 


Ser 


Cys 
415 


Pro 


Cys 


Trp 


Leu 


Glu 
420 


Val 


He 


Phe 


Asn 


Ser 
425 


Arg 















(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 50 . . . 1327 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

GAATTCGGCA CGAGGGCAAA CGACTTTTCT CCTCGCCTCC TCGCCCCGC ATG TTC AGG 5 8 

Met Phe Arg 

1 

ACC AAA CGA TCT GCG CTC GTC CGG CGT CTC TGG AGG AGC CGT GCG CCC 106 
Thr Lys Arg Ser Ala Leu Val Arg Arg Leu Trp Arg Ser Arg Ala Pro 
5 10 15 

GGC GGC GAG GAC GAG GAG GAG GGC GCA GGG GGA GGT GGA GGA GGA GGC 154 
Gly Gly Glu Asp Glu Glu Glu Gly Ala Gly Gly Gly Gly Gly Gly Gly 
20 25 30 35 

GAG CTG CGG GGA GAA GGG GCG ACG GAC AGC CGA GCG CAT GGG GCC GGT 2 02 

Glu Leu Arg Gly Glu Gly Ala Thr Asp Ser Arg Ala His Gly Ala Gly 
40 45 50 



GGC GGC GGC CCG GGC AGG GCT GGA TGC TGC CTG GGC AAG GCG GTG CGA 



250 
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Gly Gly Gly Pro Gly Arg Ala Gly Cys Cys Leu Gly Lys Ala Val Arg 
55 60 65 

GGT GCC AAA TGT CAC CAC CAT CCC CAC CCG CCA GCC GCG GGC GCC GGC 2 98 

Gly Ala Lys Cys His His His Pro His Pro Pro Ala Ala Gly Ala Gly 
70 75 80 

GCG GCC GGG GGC GCC GAG GCG GAT CTG AAG GCG CTC ACG CAC TCG GTG 34 6 

Ala Ala Gly Gly Ala Glu Ala Asp Leu Lys Ala Leu Thr His Ser Val 
85 90 95 

CTC AAG AAA CTG AAG GAG CGG CAG CTG GAG CTG CTG CTC CAG GCC GTG 3 94 

Leu Lys Lys Leu Lys Glu Arg Gin Leu Glu Leu Leu Leu Gin Ala Val 
100 105 110 115 

GAG TCC CGC GGC GGG ACG CGC ACC GCG TGC CTC CTG CTG CCC GGC CGC 442 
Glu Ser Arg Gly Gly Thr Arg Thr Ala Cys Leu Leu Leu Pro Gly Arg 
120 125 130 

CTG GAC TGC AGG CTG GGC CCG GGG GCG CCC GCC GGC GCG CAG CCT GCG 490 
Leu Asp Cys Arg Leu Gly Pro Gly Ala Pro Ala Gly Ala Gin Pro Ala 
135 140 145 

CAG CCG CCC TCG TCC TAC TCG CTC CCC CTC CTG CTG TGC AAA GTG TTC 53 8 

Gin Pro Pro Ser Ser Tyr Ser Leu Pro Leu Leu Leu Cys Lys Val Phe 
150 155 160 

AGG TGG CCG GAT CTC AGG CAT TCC TCG GAA GTC AAG AGG CTG TGT TGC 58 6 

Arg Trp Pro Asp Leu Arg His Ser Ser Glu Val Lys Arg Leu Cys Cys 
165 170 175 

TGT GAA TCT TAC GGG AAG ATC AAC CCC GAG CTG GTG TGC TGC AAC CCC 634 
Cys Glu Ser Tyr Gly Lys lie Asn Pro Glu Leu Val Cys Cys Asn Pro 
180 185 190 195 

CAT CAC CTT AGC CGA CTC TGC GAA CTA GAG TCT CCC CCC CCT CCT TAC 6 82 

His His Leu Ser Arg Leu Cys Glu Leu Glu Ser Pro Pro Pro Pro Tyr 
200 205 210 

TCC AGA TAC CCG ATG GAT TTT CTC AAA CCA ACT GCA GAC TGT CCA GAT 73 0 

Ser Arg Tyr Pro Met Asp Phe Leu Lys Pro Thr Ala Asp Cys Pro Asp 
215 220 225 

GCT GTG CCT TCC TCC GCT GAA ACA GGG GGA ACG AAT TAT CTG GCC CCT 77 8 

Ala Val Pro Ser Ser Ala Glu Thr Gly Gly Thr Asn Tyr Leu Ala Pro 
230 235 240 

GGG GGG CTT TCA GAT TCC CAA CTT CTT CTG GAG CCT GGG GAT CGG TCA 82 6 

Gly Gly Leu Ser Asp Ser Gin Leu Leu Leu Glu Pro Gly Asp Arg Ser 
245 250 255 

CAC TGG TGC GTG GTG GCA TAC TGG GAG GAG AAG ACG AGA GTG GGG AGG 874 
His Trp Cys Val Val Ala Tyr Trp Glu Glu Lys Thr Arg Val Gly Arg 
260 265 270 275 

CTC TAC TGT GTC CAG GAG CCC TCT CTG GAT ATC TTC TAT GAT CTA CCT 922 
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Leu Tyr Cys Val Gin Glu Pro Ser Leu Asp lie Phe Tyr Asp Leu Pro 
280 285 290 



CAG GGG AAT GGC TTT TGC CTC GGA CAG CTC AAT TCG GAC AAC AAG AGT 9 70 

Gin Gly Asn Gly Phe Cys Leu Gly Gin Leu Asn Ser Asp Asn Lys Ser 
295 300 305 



CAG CTG GTG CAG AAG GTG CGG AGC AAA ATC GGC TGC GGC ATC CAG CTG 1018 
Gin Leu Val Gin Lys Val Arg Ser Lys lie Gly Cys Gly lie Gin Leu 
310 315 320 



ACG CGG GAG GTG GAT GGT GTG TGG GTG TAC AAC CGC AGC AGT TAC CCC 1066 

Thr Arg Glu Val Asp Gly Val Trp Val Tyr Asn Arg Ser Ser Tyr Pro 
325 330 335 

ATC TTC ATC AAG TCC GCC ACA CTG GAC AAC CCG GAC TCC AGG ACG CTG 1114 

lie Phe lie Lys Ser Ala Thr Leu Asp Asn Pro Asp Ser Arg Thr Leu 
340 345 350 355 



TTG GTA CAC AAG GTG TTC CCC GGT TTC TCC ATC AAG GCT TTC GAC TAC 1162 
Leu Val His Lys Val Phe Pro Gly Phe Ser lie Lys Ala Phe Asp Tyr 
360 365 370 



GAG AAG GCG TAC AGC CTG CAG CGG CCC AAT GAC CAC GAG TTT ATG CAG 1210 
Glu Lys Ala Tyr Ser Leu Gin Arg Pro Asn Asp His Glu Phe Met Gin 
375 380 385 



CAG CCG TGG ACG GGC TTT ACC GTG CAG ATC AGC TTT GTG AAG GGC TGG 12 5 8 

Gin Pro Trp Thr Gly Phe Thr Val Gin lie Ser Phe Val Lys Gly Trp 
390 395 400 



GGC CAG TGC TAC ACC CGC CAG TTC ATC AGC AGC TGC CCG TGC TGG CTA 13 06 

Gly Gin Cys Tyr Thr Arg Gin Phe lie Ser Ser Cys Pro Cys Trp Leu 
405 410 415 



GAG GTC ATC TTC AAC AGC CGG TAGCCGCGTG CGGAGGGGAC AGAGCGTGAG CTGA 13 61 
Glu Val lie Phe Asn Ser Arg 
420 425 



GCAGGCCACA CTTCAAACTA CTTTGCTGCT AATATTTTCC TCCTGAGTGC TTGCTTTTCA 1421 

TGCAAACTCT TTGGTCGTTT TTTTTTTGTT TGTTGGTTGG TTTTCTTCTT CTCGTCCTCG 14 81 

TTTGTGTTCT ATTTTTCTAA CTACAAAGGT TTAAATGAAC AAGAGAAGCA TCTCTCATTG 1541 

GAAATTTAGC ATTGTAGTGC TTTGAGAGAG AAAGGACTCC CTGNAAAAAA ACCTGAGATT 16 01 

TATTAAAGNA AAAAATGTAT TTTATGTTAT ATATAAATAT ATTATTACTT GTAAATATAA 1661 

AGACGTTTTA TAAGCATCAT TATTTATGTA TTGTGCAATG TGTATAAACN AGNAAAATAA 1721 

AGAAAAGATG CACTTTGCTT TAATATAAAT GCAAATAACA AATGCCAAAT TAAAAAAGAT 1781 

AAACACAAGA TTGGTGTTTT TTTCTATGGG TGTTATCACC TAGCNGAATG TTTTTCTAAA 1841 

GGAGTTTATG TTCCATTAAA CGATTTTTAA AANGT 18 76 



(2) INFORMATION FOR SEQ ID NO : 6 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Phe Arg Thr Lys Arg Ser Ala Leu Val Arg Arg Leu Trp Arg Ser 

15 10 15 

Arg Ala Pro Gly Gly Glu Asp Glu Glu Glu Gly Ala Gly Gly Gly Gly 

20 25 30 

Gly Gly Gly Glu Leu Arg Gly Glu Gly Ala Thr Asp Ser Arg Ala His 

35 40 45 

Gly Ala Gly Gly Gly Gly Pro Gly Arg Ala Gly Cys Cys Leu Gly Lys 

50 55 60 

Ala Val Arg Gly Ala Lys Cys His His His Pro His Pro Pro Ala Ala 
65 70 75 80 

Gly Ala Gly Ala Ala Gly Gly Ala Glu Ala Asp Leu Lys Ala Leu Thr 

85 90 95 

His Ser Val Leu Lys Lys Leu Lys Glu Arg Gin Leu Glu Leu Leu Leu 

100 105 110 

Gin Ala Val Glu Ser Arg Gly Gly Thr Arg Thr Ala Cys Leu Leu Leu 

115 120 125 

Pro Gly Arg Leu Asp Cys Arg Leu Gly Pro Gly Ala Pro Ala Gly Ala 

130 135 140 

Gin Pro Ala Gin Pro Pro Ser Ser Tyr Ser Leu Pro Leu Leu Leu Cys 
145 150 155 160 

Lys Val Phe Arg Trp Pro Asp Leu Arg His Ser Ser Glu Val Lys Arg 

165 170 175 

Leu Cys Cys Cys Glu Ser Tyr Gly Lys lie Asn Pro Glu Leu Val Cys 

180 185 190 

Cys Asn Pro His His Leu Ser Arg Leu Cys Glu Leu Glu Ser Pro Pro 

195 200 205 

Pro Pro Tyr Ser Arg Tyr Pro Met Asp Phe Leu Lys Pro Thr Ala Asp 

210 215 220 

Cys Pro Asp Ala Val Pro Ser Ser Ala Glu Thr Gly Gly Thr Asn Tyr 
225 230 235 240 

Leu Ala Pro Gly Gly Leu Ser Asp Ser Gin Leu Leu Leu Glu Pro Gly 

245 250 255 

Asp Arg Ser His Trp Cys Val Val Ala Tyr Trp Glu Glu Lys Thr Arg 

260 265 270 

Val Gly Arg Leu Tyr Cys Val Gin Glu Pro Ser Leu Asp lie Phe Tyr 

275 280 285 

Asp Leu Pro Gin Gly Asn Gly Phe Cys Leu Gly Gin Leu Asn Ser Asp 

290 295 300 

Asn Lys Ser Gin Leu Val Gin Lys Val Arg Ser Lys lie Gly Cys Gly 
305 310 315 320 

lie Gin Leu Thr Arg Glu Val Asp Gly Val Trp Val Tyr Asn Arg Ser 

325 330 335 

Ser Tyr Pro lie Phe lie Lys Ser Ala Thr Leu Asp Asn Pro Asp Ser 

340 345 350 

Arg Thr Leu Leu Val His Lys Val Phe Pro Gly Phe Ser lie Lys Ala 

355 360 365 

Phe Asp Tyr Glu Lys Ala Tyr Ser Leu Gin Arg Pro Asn Asp His Glu 

370 375 380 

Phe Met Gin Gin Pro Trp Thr Gly Phe Thr Val Gin lie Ser Phe Val 
385 390 395 400 

Lys Gly Trp Gly Gin Cys Tyr Thr Arg Gin Phe lie Ser Ser Cys Pro 
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405 410 415 

Cys Trp Leu Glu Val lie Phe Asn Ser Arg 
420 425 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1281 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ATGTTCAGGA CCAAACGATC TGCGCTCGTC CGGCGTCTCT GGAGGAGCCG TGCGCCCGGC 6 0 

GGCGAGGACG AGGAGGAGGG CGTGGGGGGT GGCGGCGGAG GAGGCGAGCT GCGGGGAGAA 12 0 

GGGGCGACGG ACGGCCGGGC TTATGGGGCT GGTGGCGGCG GTGCGGGCAG GGCTGGCTGC 18 0 

TGCCTGGGCA AGGCAGTCCG AGGTGCCAAA GGTCACCACC ATCCCCATCC CCCAACCTCG 24 0 

GGTGCCGGGG CGGCCGGGGG CGCCGAGGCG GATCTGAAGG CGCTCACGCA CTCGGTGCTC 3 00 

AAGAAACTCA AGGAGCGGCA GCTGGAGCTG CTGCTTCAGG CCGTGGAGTC CCGCGGCGGT 36 0 

ACGCGCACCG CGTGCCTCCT GCTGCCCGGC CGCCTGGACT GCAGGCTGGG CCCGGGGGCG 42 0 

CCCGCCAGCG CGCAGCCCGC GCAGCCGCCC TCGTCCTACT CGCTCCCCCT CCTGCTGTGC 4 80 

AAAGTGTTCA GGTGGCCGGA TCTCAGGCAT TCCTCGGAAG TCAAGAGGCT GTGTTGCTGT 54 0 

GAATCTTACG GGAAGATCAA CCCCGAGCTG GTGTGCTGCA ACCCCCATCA CCTTAGTCGA 600 

CTCTGTGAAC TAGAGTCTCC CCCTCCTCCT TACTCCAGAT AC C C AATGG A TTTTCTCAAA 6 60 

CCAACTGCAG GCTGTCCAGA TGCTGTACCT TCCTCCGCGG AAACCGGGGG AACGAATTAT 72 0 

CTGGCCCCTG GGGGGCTTTC AGATTCCCAA CTTCTTCTGG AGCCTGGGGA TCGGTCACAC 78 0 

TGGTGCGTGG TGGCATACTG GGAGGAGAAG ACTCGCGTGG GGAGGCTCTA CTGTGTCCAA 84 0 

GAGCCCTCCC TGGATATCTT CTATGATCTA CCTCAGGGGA ATGGCTTTTG CCTCGGACAG 900 

CTCAATTCGG ACAACAAGAG TCAGCTGGTA CAGAAAGTGC GGAGCAAGAT CGGCTGTGGC 96 0 

ATCCAGCTGA CGCGGGAAGT GGATGGCGTG TGGGTTTACA ACCGCAGCAG TTACCCCATC 102 0 

TTCATCAAGT CCGCCACACT GGACAACCCG GACTCCAGGA CGCTGTTGGT GCACAAAGTG 1080 

TTCCCTGGTT TCTCCATCAA GGCTTTTGAC TATGAGAAAG CCTACAGCCT GCAGCGGCCC 114 0 

AATGACCACG AGTTCATGCA GCAACCATGG ACGGGTTTCA CCGTGCAGAT CAGCTTTGTG 12 00 

AAGGGCTGGG GCCAGTGCTA CACCCGCCAG TTCATCAGCA GCTGCCCGTG CTGGCTGGAG 12 6 0 

GTCATCTTCA ACAGCCGGTA G 12 81 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1281 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

ATGTTCAGGA CCAAACGATC TGCGCTCGTC CGGCGTCTCT GGAGGAGCCG TGCGCCCGGC 6 0 

GGCGAGGACG AGGAGGAGGG CGCAGGGGGA GGTGGAGGAG GAGGCGAGCT GCGGGGAGAA 12 0 

GGGGCGACGG ACAGCCGAGC GCATGGGGCC GGTGGCGGCG GCCCGGGCAG GGCTGGATGC 180 

TGCCTGGGCA AGGCGGTGCG AGGTGCCAAA TGTCACCACC ATCCCCACCC GCCAGCCGCG 24 0 

GGCGCCGGCG CGGCCGGGGG CGCCGAGGCG GATCTGAAGG CGCTCACGCA CTCGGTGCTC 3 00 
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AAGAAACTGA AGGAGCGGCA GCTGGAGCTG CTGCTCCAGG CCGTGGAGTC CCGCGGCGGG 36 0 

ACGCGCACCG CGTGCCTCCT GCTGCCCGGC CGCCTGGACT GCAGGCTGGG CCCGGGGGCG 42 0 

CCCGCCGGCG CGCAGCCTGC GCAGCCGCCC TCGTCCTACT CGCTCCCCCT CCTGCTGTGC 4 80 

AAAGTGTTCA GGTGGCCGGA TCTCAGGCAT TCCTCGGAAG TCAAGAGGCT GTGTTGCTGT 54 0 

GAATCTTACG GGAAGATCAA CCCCGAGCTG GTGTGCTGCA ACCCCCATCA CCTTAGCCGA 600 

CTCTGCGAAC TAGAGTCTCC CCCCCCTCCT TACTCCAGAT AC C CGATGGA TTTTCTCAAA 66 0 

CCAACTGCAG ACTGTCCAGA TGCTGTGCCT TCCTCCGCTG AAACAGGGGG AACGAATTAT 72 0 

CTGGCCCCTG GGGGGCTTTC AGATTCCCAA CTTCTTCTGG AGCCTGGGGA TCGGTCACAC 78 0 

TGGTGCGTGG TGGCATACTG GGAGGAGAAG ACGAGAGTGG GGAGGCTCTA CTGTGTCCAG 84 0 

GAGCCCTCTC TGGATATCTT CTATGATCTA CCTCAGGGGA ATGGCTTTTG CCTCGGACAG 900 

CTCAATTCGG ACAACAAGAG TCAGCTGGTG CAGAAGGTGC GGAGCAAAAT CGGCTGCGGC 96 0 

ATCCAGCTGA CGCGGGAGGT GGATGGTGTG TGGGTGTACA ACCGCAGCAG TTACCCCATC 102 0 

TTCATCAAGT CCGCCACACT GGACAACCCG GACTCCAGGA CGCTGTTGGT ACACAAGGTG 108 0 

TTCCCCGGTT TCTCCATCAA GGCTTTCGAC TACGAGAAGG CGTACAGCCT GCAGCGGCCC 114 0 

AATGACCACG AGTTTATGCA GCAGCCGTGG ACGGGCTTTA CCGTGCAGAT CAGCTTTGTG 12 00 

AAGGGCTGGG GCCAGTGCTA CACCCGCCAG TTCATCAGCA GCTGCCCGTG CTGGCTAGAG 12 6 0 

GTCATCTTCA ACAGCCGGTA G 12 81 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ATGTTCAGGT CCAAACGCTC GGGGCTGGTG CGGCGACTTT GGCGAAGTCG TGTGGTCCCC 6 0 

GACCGGGAGG AAGGCGGCAG CGGCGGCGGC GGTGGCGGCG ACGAGGATGG GAGCTTGGGC 12 0 

AGCCGAGCTG AGCCGGCCCC GCGGGCAAGA GAGGGCGGAG GCTGCGGCCG CTCCGAAGTC 180 

CGCCCGGTAG CCCCGCGGCG GCCCCGGGAC GCAGTGGGAC AGCGAGGCGC CCAGGGCGCG 24 0 

GGGAGGCGCC GGCGCGCAGG GGGCCCCCCG AGGCCCATGT CGGAGCCAGG GGCCGGCGCT 300 

GGGAGCTCCC TGCTGGACGT GGCGGAGCCG GGAGGCCCGG GCTGGCTGCC CGAGAGTGAC 3 60 

TGCGAGACGG TGACCTGCTG TCTCTTTTCG GAGCGGGACG CCGCCGGCGC GCCCCGGGAC 42 0 

GCCAGCGACC CCCTGGCCGG GGCGGCCCTG GAGCCGGCGG GCGGCGGGCG GAGTCGCGAA 480 

GCGCGCTCGC GGCTGCTGCT GCTGGAGCAG GAACTCAAAA CCGTCACGTA CTCGCTGCTG 54 0 

AAGCGGCTCA AGGAGCGCTC GCTGGACACG CTGCTGGAGG CGGTGGAGTC CCGCGGCGGC 600 

GTGCCGGGCG GCTGCGTGCT GGTGCCGCGC GCCGACCTCC GCCTGGGCGG CCAGCCCGCG 660 

CCGCCGCAGC TGCTGCTCGG CCGCCTCTTT CGCTGGCCCG ACCTGCAGCA CGCCGTGGAG 72 0 

CTGAAGCCCC TGTGCGGCTG CCACAGCTTC GCCGCCGCCG CCGACGGCCC TACCGTGTGC 78 0 

TGCAACCCCT ACCACTTCAG CCGGCTCTGC GGGCCCGAAT CTCCGCCACC TCCCTACTCT 84 0 

CGGCTGTCTC CTCGCGACGA GTACAAGCCA CTGGATCTGT CCGATTCCAC ATTGTCTTAC 90 0 

ACTGAAACGG AGGCTACCAA CTCCCTCATC ACTGCTCCGG GTGAATTCTC AGACGCCAGC 96 0 

ATGTCTCCGG ACGCCACCAA GCCGAGCCAC TGGTGCAGCG TGGCGTACTG GGAGCACCGG 102 0 

ACGCGCGTGG GCCGCCTCTA TGCGGTGTAC GACCAGGCCG TCAGCATCTT CTACGACCTA 108 0 

CCTCAGGGCA GCGGCTTCTG CCTGGGCCAG CTCAACCTGG AGCAGCGCAG CGAGTCGGTG 1140 

CGGCGAACGC GCAGCAAGAT CGGCTTCGGC ATCCTGCTCA GCAAGGAGCC CGACGGCGTG 12 00 

TGGGCCTACA ACCGCGGCGA GCACCCCATC TTCGTCAACT CCCCGACGCT GGACGCGCCC 12 60 

GGCGGCCGCG CCCTGGTCGT GCGCAAGGTG CCCCCCGGCT ACTCCATCAA GGTGTTCGAC 132 0 

TTCGAGCGCT CGGGCCTGCA GCACGCGCCC GAG C C CGACG CCGCCGACGG CCCCTACGAC 13 8 0 

CCCAACAGCG TCCGCATCAG CTTCGCCAAG GGCTGGGGGC CCTGCTACTC CCGGCAGTTC 144 0 

ATCACCTCCT GCCCCTGCTG GCTGGAGATC CTCCTCAACA ACCCCAGATA G 14 91 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 496 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Glu Ser Pro Pro Pro Pro Tyr Ser Arg Leu Ser Pro Arg Asp Glu Tyr 

15 10 15 

Lys Pro Leu Asp 
20 

(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Ser Arg Gin Phe He Thr Ser Cys Pro Cys Trp Leu Glu He Leu Asn 

15 10 15 

Pro Arg 



(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

Lys Glu Arg Gin Leu Glu Leu Leu Leu Gin Ala Val Glu Ser Arg Gly 

15 10 15 

Gly Thr Arg Thr Ala 
20 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Lys Ala Val Arg Gly Ala Lys Gly His His His Pro His Pro Pro 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CTCGAGATCC GCCGCCAGGG CCTTAAAACA AAACGAATGA ATGAAGCGCC CGGGCGAGCT 60 

CTGCTGGTCT AAATCGGGCC ACTAACCGGC TCATTCCGCT CCAAACTCGC CCGCTCCATG 12 0 

CCCCCAGCTC TTCCGATTTC CCCCTGCTCC CGCTGGGTCT CCGCCAGACC CCCGGGGCCG 18 0 

TCTCGGCCTC CCACCTCCCT CCACCGTGGG TACCCCTCTC TAGACCTGGG AGAGGGTGGC 24 0 

AGTAACTGGG AGGGGGGTTG AAATAGCTTT TAGAAACCCG ATCTGTTGTT TGCGAAACAC 3 00 

AATCGCTTTT TTTTTTTTTA AAGCGACAGG GTGTCTAGAC GGCCACGTGA CGAGGCCGGA 360 

GCCGGGCGCG CCACTGCGCA GTGGAACCAG C CGAGC AG AG GGACGGGTGG GGGGGGCGGG 42 0 

AAGGAGGCGG CGGCGGCTGG GGGCGGGGGA GGGAGGGGTA GAGGGGGGAG GGAAGGGGGC 4 80 

GGAGGCGGGA GGCCTTGCGG GAGGCGGCGA GCCCTGGGCA CATTCGCTCG CTGATCGGCG 54 0 

CACAGAGGAT CTTGTCCCCG AGCTGCGCCA GC AG AG CC AG CCGGGCGCCT CGCTCGGTCC 600 

GCTCGCCGCG C CGG AGAGAG CTGCCTGAGA CGCAGCCAGC CAGCCAGCCG GCGCCACGCC 66 0 

GCCGAGCGCT CGGCCCCGGA GTCCCTGAGT GCGGCGCGGC GAGCCCCCAG CGGCGGCAGA 72 0 

AGGACTCGAG ATC 73 3 
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CLAIMS 

1 . An isolated nucleic acid molecule selected from the group consisting of 

(a) nucleic acid molecules which hybridize under stringent conditions to a molecule 
consisting of the nucleic acid sequence set forth in SEQ ID NO: 3 or SEQ ID NO: 5 and which 

5 code for a polypeptide which inhibits TGF-p superfamily signaling, 

(b) nucleic acid molecules that differ from the nucleic acid molecules of (a) in codon 
sequence due to the degeneracy of the genetic code, and 

(c) complements of (a) and (b). 

10 2. The isolated nucleic acid molecule of claim 1 , wherein the isolated nucleic acid molecule 
comprises SEQ ID NO:7 or SEQ ID NO:8. 

3. The isolated nucleic acid molecule of claim 2, wherein the isolated nucleic acid molecule 
consists of SEQ ID NO:7. 

15 

4. The isolated nucleic acid molecule of claim 2, wherein the isolated nucleic acid molecule 
consists of SEQ ID NO:8. 

5. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid molecule 
20 consists of SEQ ID NO:3. 

6. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid molecule 
consists of SEQ ID NO:5. 

25 7. An isolated nucleic acid molecule selected from the group consisting of (a) a unique 

fragment of SEQ ID NO:3 between 12 and 1944 nucleotides in length, (b) a unique fragment of 
SEQ ID NO:5 between 12 and 1875 nucleotides in length, (c) complements of "(a)" and (d) 
complements of "(b)", provided that the nucleic acid molecule excludes sequences consisting of 
SEQ ID NO: 1 and SEQ ID NO:2. 

30 

8. The isolated nucleic acid molecule of claim 7, wherein the isolated nucleic acid molecule 
consists of at least 14 contiguous nucleotides. 



WO 98/53068 PCT/US98/1 0305 

- 78 - 

9. The isolated nucleic acid molecule of claim 7, wherein the isolated nucleic acid molecule 
consists of at least 15 contiguous nucleotides. 

10. The isolated nucleic acid molecule of claim 7, wherein the isolated nucleic acid molecule 
5 consists of at least 16 contiguous nucleotides. 

1 1 . The isolated nucleic acid molecule of claim 7, wherein the isolated nucleic acid molecule 
consists of at least 17 contiguous nucleotides. 

10 12. The isolated nucleic acid molecule of claim 7, wherein the isolated nucleic acid molecule 
consists of at least 18 contiguous nucleotides. 

13. The isolated nucleic acid molecule of claim 7, wherein the isolated nucleic acid molecule 
consists of at least 20 contiguous nucleotides. 

15 

14. The isolated nucleic acid molecule of claim 7, wherein the isolated nucleic acid molecule 
consists of at least 22 contiguous nucleotides. 

15. The isolated nucleic acid molecule of claim 7, wherein the isolated nucleic acid molecule 
20 consists of between 12 and 32 contiguous nucleotides. 

16. An expression vector comprising the isolated nucleic acid molecule of claims 1, 2, 3, 4, 5, 
6, 7, 8, 9, 10, 11, 12, 13, 14 or 15 operably linked to a promoter. 

25 17. A host cell transformed or transfected with the expression vector of claim 16. 

17B. A transgenic non-human animal comprising the expression vector of claim 16. 

17B2. The transgenic non-human animal of claim 17B, comprising a conditional Smad7 
30 expression vector. 

17C. A transgenic non-human animal which has reduced expression of Smad7 nucleic 
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acid molecules. 

17D. The transgenic non-human animal of claim 17C, comprising a Smad7 gene 
disrupted by homologous recombination. 

5 

17E. The transgenic non-human animal of claim 17C, comprising a conditional Smad7 
gene disruption. 

17F. The transgenic non-human animal of claim 17C, comprising a trans-acting negative 
10 regulator of Smad7 expression. 

1 8. An isolated polypeptide encoded by the isolated nucleic acid molecule of claim 1, 2, 3, 4, 
5 or 6, wherein the polypeptide has TGF-p superfamily signaling inhibitory activity. 

15 19. The isolated polypeptide of claim 18, wherein the isolated polypeptide consists of a 
molecule comprising the amino acid sequence of SEQ ID NO:4 or SEQ ID NO:6. 

20. The isolated polypeptide of claim 18, wherein the isolated polypeptide consists of a 
functional fragment or variant. 

20 

21 . The isolated polypeptide of claim 20, wherein the isolated polypeptide consists of amino 
acids 204-426 of SEQ ID NO:4 or 6. 

2 IB. The isolated polypeptide of claim 20, wherein the isolated polypeptide consists of amino 
25 acids 2-261 of SEQ ID NO:4 or 6. 

22. An isolated polypeptide which binds selectively a polypeptide encoded by the isolated 
nucleic acid molecule of claim 1, 2, 3, 4, 5 or 6, provided that the isolated polypeptide is not a 
TGFp superfamily type I receptor. 

30 

23. The isolated polypeptide of claim 22, wherein the isolated polypeptide binds to an 
epitope defined by a polypeptide consisting of the sequence of SEQ ID NOs:4 or 6. 
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23B. The isolated polypeptide of claim 22, wherein the polypeptide is an antibody or fragment 
thereof which selectively binds an epitope defined by a polypeptide selected from the group 
consisting of SEQ ID NO:l 1, SEQ ID NO: 12, SEQ ID NO: 13, and SEQ ID NO: 14. 

5 24. The isolated polypeptide of claim 22, wherein the isolated polypeptide is an antibody 
fragment selected from the group consisting of a Fab fragment, a F(ab) 2 fragment or a fragment 
including a CDR3 region selective for a Smad7 polypeptide. 

24B. The isolated polypeptide of claim 22, wherein the isolated polypeptide is a monoclonal 
10 antibody, a humanized antibody or a chimeric antibody. 

25. An isolated complex of polypeptides comprising: 

a polypeptide as claimed in claim 1 8 bound to a receptor selected from the group 
consisting of activated TGFp superfamily type I receptors and complexes of TGFp superfamily 
15 type I receptors and TGFp superfamily type II receptors. 

25B. The isolated complex of claim 25, wherein the receptor is selected from the group 
consisting of TpRI, BMPR-IA, BMPR-IB, ActR-IA, a complex of TpRI and TpRII, a complex 
of BMPR-IA and BMPR-II, a complex of BMPR-IB and BMPR-II, a complex of ActR-IA and 
20 BMPR-II and a complex of ActR-IA and ActR-II. 

26. The isolated complex of polypeptides of claim 25, wherein the polypeptide as claimed in 
claim 1 consists of the amino acid sequence of SEQ ID NO: 6. 

25 27. A method for reducing TGF-P superfamily signal transduction in a mammalian cell, 
comprising 

contacting the mammalian cell with an amount of an agent which increases the amount of 
Smad7 or a fragment thereof in the cell effective to reduce TGF-P superfamily signal 
transduction in the mammalian cell. 

30 

28. The method of claim 27, wherein the TGF-P superfamily signal transduction is signal 
transduction mediated by a TGFp superfamily ligand. 
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28B. The method of claim 28, wherein the TGF0 superfamily ligand is selected from the group 
consisting of TGF-01, activin, Vgl, BMP-4 and BMP-7. 

29. The method of claim 28, wherein the agent is selected from the group consisting of a 
5 nucleic acid encoding the amino acid sequence of SEQ ID NO:4, a nucleic acid encoding the 

amino acid sequence of SEQ ID NO: 6, a nucleic acid encoding amino acids 204-426 of SEQ ID 
NO:4, a nucleic acid encoding amino acids 204-426 of SEQ ID NO:6, a nucleic acid encoding 
amino acids 2-261 of SEQ ID NO:4, a nucleic acid encoding amino acids 2-261 of SEQ ID 
NO:6, and a polypeptide as claimed in claim 18. 

10 

30. The method of claim 28, wherein the agent is selected from the group consisting of 
activin, epidermal growth factor and phorbol esters. 

31. A method for reducing phosphorylation of a pathway specific Smad polypeptide in a 
15 mammalian cell, comprising 

contacting the mammalian cell with an amount of the isolated nucleic acid of claim 1 or 
an isolated Smad polypeptide expression product thereof, effective to reduce phosphorylation of 
the pathway specific Smad polypeptide in the mammalian cell. 

20 3 IB. The method of claim 3 1 , wherein the pathway specific Smad polypeptide is selected from 
the group consisting of Smadl, Smad2, Smad3 and Smad5. 

31C. The method of claim 31, wherein the isolated Smad polypeptide is selected from the 
group consisting of Smad7, SmadS, a N-terminal fragment of Smad7 and a C-terminal fragment 
25 of Smad7. 

32. A method for modulating the proliferation of a cell, the proliferation of which is altered 
by TGF-P superfamily signal transduction, comprising 

contacting the cell with an amount of an isolated nucleic acid of claim 1 or an isolated 
30 Smad polypeptide expression product thereof, effective to modulate the proliferation of the cell. 



33. A method of increasing TGF-p superfamily signal transduction in a cell comprising 



WO 98/53068 PCT/US98/1 0305 

- 82 - 

contacting the cell with an amount of an agent that selectively binds to an isolated nucleic 
acid molecule of claim 1 or an isolated Smad polypeptide expression product thereof, effective to 
increase the TGF-P superfamily signal transduction in the cell. 

5 34. The method of claim 33 wherein the TGF-P superfamily signal transduction is mediated 
by a TGFp superfamily ligand selected from the group consisting of TGF-pl 5 activin, Vgl, 
BMP-4 and BMP-7. 

35. The method of claim 34, wherein the agent is an antisense nucleic acid. 

10 

36. The method of claim 34, wherein the agent is a polypeptide. 

37. The method of claim 36, wherein the polypeptide is selected from the group consisting of 
(1) an antibody which binds a polypeptide selected from the group consisting of a polypeptide 

15 comprising the amino acid sequence of SEQ ID NO:4, a polypeptide comprising the amino acid 
sequence of SEQ ID NO:6, a polypeptide comprising the amino acid sequence of SEQ ID 
NO: 13, a polypeptide comprising the amino acid sequence of SEQ ID NO: 14, aN-terminal 
fragment of Smad7 and a C-terminal fragment of Smad7, and (2) a dominant negative variant of 
Smad7. 

20 

38. A composition comprising : 

the polypeptide of claim 18, and 

a pharmaceutically acceptable carrier. 

25 39. A method for decreasing Smad7 TGF-P superfamily inhibitory activity in a subject 
comprising 

administering to a subject in need of such treatment an agent that selectively binds to an 
isolated nucleic acid molecule of claim 1 or an expression product thereof, in an amount effective 
to decrease Smad7 TGF-p superfamily inhibitory activity in the subject. 

30 

40. The method of claim 39 wherein the TGF-p superfamily inhibitory activity is inhibition 
of TGF-P superfamily signal transduction mediated by a TGFp superfamily ligand selected from 
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the group consisting of TGF-P 1, activin, Vgl, BMP-4 and BMP-7. 

41 . The method of claim 39, wherein the agent is an antisense nucleic acid. 

5 42. The method of claim 39, wherein the agent is a polypeptide. 

43. The method of claim 42, wherein the polypeptide is selected from the group consisting of 
(1) an antibody which binds a polypeptide selected from the group consisting of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:4, a polypeptide comprising the amino acid 
10 sequence of SEQ ID NO: 6, a polypeptide comprising the amino acid sequence of SEQ ID 
NO: 13, a polypeptide comprising the amino acid sequence of SEQ ID NO: 14, a N-terminal 
fragment of Smad7 and a C-terminal fragment of Smad7, and (2) a dominant negative variant of 
Smad7. 



15 44. A method for diagnosing induction of a TGF-p superfamily ligand in a cell, comprising: 

(a) measuring the amount of Smad7 RNA or polypeptide in the cell, and 

(b) comparing the result of (a) with a control. 



45. A method for determining the presence of a functional TGFp superfamily receptor in a 
20 cell, comprising: 

(a) contacting the cell with an amount of TGFp superfamily ligand effective to increase 
the amount of Smad7 in the cell, 

(b) measuring the amount of Smad7 RNA or polypeptide in the cell, and 

(c) comparing the result of (b) with a control, wherein an increased amount of Smad7 

25 RNA or polypeptide in the cell indicates the presence of a functional TGFp superfamily receptor. 

45B. The method of claim 45, wherein the TGFp superfamily receptor is selected from the 
group consisting of TGF-p superfamily type I receptors, TGF-P superfamily type II receptors, 
and complexes of TGF-P superfamily type I receptors and TGF-P superfamily type II receptors. 



46. A method for identifying lead compounds for a pharmacological agent useful in the 
diagnosis or treatment of disease associated with Smad7 TGF-P superfamily inhibitory activity, 
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comprising 

forming a mixture comprising a Smad7 polypeptide, a TGF-P superfamily receptor 
complex or an activated TGFp superfamily type I receptor, and a candidate pharmacological 
agent, 

5 incubating the mixture under conditions which, in the absence of the candidate 

pharmacological agent, permit a first amount of specific binding of the TGF-P superfamily 
receptor complex or activated TGFp superfamily type I receptor by the Smad7 polypeptide, and 

detecting a test amount of the specific binding of the TGF-P superfamily receptor 
complex or activated TGFp superfamily type I receptor by the Smad7 polypeptide, wherein 

10 reduction of the test amount of specific binding relative to the first amount of specific binding 
indicates that the candidate pharmacological agent is a lead compound for a pharmacological 
agent which disrupts the Smad7 TGF-p superfamily signal transduction inhibitory activity, and 
wherein increase of the test amount of specific binding relative to the first amount of specific 
binding indicates that the candidate pharmacological agent is a lead compound for a 

15 pharmacological agent which enhances the Smad7 TGF-P superfamily signal transduction 
inhibitory activity. 

47. The method of claim 46, wherein members of the TGF-P superfamily are selected from 
the group consisting of TGF-p 1, activin, Vgl, BMP -4 and BMP-7. 

20 

47B. The method of claim 46, wherein the Smad7 polypeptide is selected from the group 
consisting of a polypeptide comprising the amino acid sequence of SEQ ID NO:4, a polypeptide 
comprising the amino acid sequence of SEQ ID NO:6, a N-terminal fragment of Smad7 and a C- 
terminal fragment of Smad7. 

25 

48. A method for identifying lead compounds for a pharmacological agent useful in the 
diagnosis or treatment of disease associated with Smad7 TGF-P superfamily signal transduction 
inhibitory activity, comprising 

forming a mixture comprising a Smad7 polypeptide, a TGF-p superfamily receptor 
30 complex or an activated TGF-p superfamily type I receptor, a pathway specific Smad 
polypeptide and a candidate pharmacological agent, 

incubating the mixture under conditions which, in the absence of the candidate 
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pharmacological agent, permit a first amount of TGF-P superfamily induced phosphorylation of 
the pathway specific Smad polypeptide, and 

detecting a test amount of TGF-P superfamily induced phosphorylation of the pathway 
specific Smad polypeptide, wherein reduction of the test amount of the phosphorylation of the 
5 pathway specific Smad polypeptide relative to the first amount of the phosphorylation of the 
pathway specific Smad polypeptide indicates that the candidate pharmacological agent is a lead 
compound for a pharmacological agent which enhances the TGF-p superfamily signal 
transduction inhibitory activity of Smad7 and wherein increase of the test amount of the 
phosphorylation of the pathway specific Smad polypeptide relative to the first amount of the 
10 phosphorylation of the pathway specific Smad polypeptide indicates that the candidate 

pharmacological agent is a lead compound for a pharmacological agent which disrupts the TGF- 
p superfamily signal transduction inhibitory activity of Smad7. 

48B. The method of claim 48, wherein the pathway specific Smad polypeptide is selected from 
15 the group consisting of Smadl, Smad2, Smad3 and Smad5. 

48C. The method of claim 48, wherein the Smad7 polypeptide is selected from the group 
consisting of a polypeptide comprising the amino acid sequence of SEQ ID NO:4, a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 6, a N-terminal fragment of Smad7 and a C- 
20 terminal fragment of Smad7. 

49. The method of claim 48, wherein members of the TGF-P superfamily are selected from 
the group consisting of TGF-p 1, activin, Vgl, BMP-4 and BMP-7. 

25 50. A method for reducing expression of a Smad6 or Smad7 nucleic acid or expression 
product thereof in a cell, comprising 

contacting the cell with an amount of an agent which binds selectively to a Smad4 nucleic 
acid or polypeptide effective to reduce the expression of the Smad6 or Smad7 nucleic acid or 
expression product thereof in the cell. 

30 

5 1 . The method of claim 50, wherein the agent is an antisense Smad4 molecule. 
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52. The method of claim 50, wherein the agent is an antibody that selectively binds to 
Smad4. 

53. A method for increasing Smad6 or Smad7 expression in a cell, comprising 

5 contacting the cell with an agent selected from the group consisting of activin, epidermal 

growth factor and phorbol esters in an amount effective to increase Smad6 or Smad7 expression 
in the cell. 

54. A method for treating a subject having lung cancer characterized by elevated expression 
10 of a gene selected from the group consisting of Smad6 and Smad7, comprising, 

administering to the subject an amount of an agent which reduces the expression or 
activity of a nucleic acid molecule or an expression product thereof encoded by the gene. 

54B. The method of claim 54, wherein the agent selectively binds the nucleic acid molecule or 
1 5 an expression product thereof. 

54C. The method of claim 54B, wherein the agent is an antisense nucleic acid molecule 

54D. The method of claim 54B, wherein the agent is a polypeptide. 

20 

54E. The method of claim 54D, wherein the polypeptide is an antibody or fragment thereof 
which binds a polypeptide selected from the group consisting of a polypeptide comprising the 
amino acid sequence of SEQ ID NO:4, a polypeptide comprising the amino acid sequence of 
SEQ ID NO:6, a N-terminal fragment of Smad7 and a C-terminal fragment of Smad7, a 
25 polypeptide comprising the amino acid sequence of SEQ ID NO: 10, a polypeptide comprising 
the amino acid sequence of SEQ ID NO:l 1, a polypeptide comprising the amino acid sequence 
of SEQ ID NO: 12, a polypeptide comprising the amino acid sequence of SEQ ID NO: 13, and a 
polypeptide comprising the amino acid sequence of SEQ ID NO: 14. 

30 54F. The method of claim 54B, wherein the agent is a dominant negative variant of Smad6 or 
Smad7. 
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55. A method for reducing eye defects in a developing mammalian embryo, comprising 
contacting the cells of the embryo with an agent which reduces the expression or activity of a 
Smad7 nucleic acid molecule or an expression product thereof. 

5 56. The method of claim 55, wherein the agent selectively binds the Smad7 nucleic acid 
molecule or an expression product thereof. 

57. The method of claim 56, wherein the agent is an antisense nucleic acid molecule 
10 58. The method of claim 56, wherein the agent is a polypeptide. 

59. The method of claim 58, wherein the polypeptide is an antibody or fragment thereof 
which binds a polypeptide selected from the group consisting of a polypeptide comprising the 
amino acid sequence of SEQ ID NO:4, a polypeptide comprising the amino acid sequence of 

15 SEQ ID NO:6, a polypeptide comprising the amino acid sequence of SEQ ID NO: 1 3, a 

polypeptide comprising the amino acid sequence of SEQ ID NO: 14, a N-terminal fragment of 
Smad7 and a C-terminal fragment of Smad7. 

60. The method of claim 56, wherein the agent is a dominant negative variant of Smad7. 

61 . An isolated polypeptide which is localized in the nucleus of a cell having a TGFp 
superfamily receptor and which is exported from the nucleus to the cytoplasm of the cell upon 
TGFP superfamily receptor-mediated signal transduction, comprising 

a first polypeptide or fragment thereof linked to a second polypeptide or fragment thereof, 
wherein the second polypeptide or fragment thereof comprises a MH2 domain of Smad7. 

6 IB. An fusion protein comprising a Smad7 MH2 domain or a nuclear localization fragment 
thereof. 

30 62. A fusion protein comprising a Smad7 MH2 domain or a transcriptional activation 
fragment thereof. 
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63. A method for modulating transcription of Smad7-regulated gene transcription, 
comprising 

contacting a mammalian cell with an agent which modulates TGFp superfamily receptor- 
mediated signal transduction in an amount effective to modulate Smad7-regulated gene 
5 transcription. 

64. The method of claim 63, wherein the agent is a TGFp superfamily ligand or an inhibitor 
of TGFp superfamily receptor-mediated signal transduction. 

10 65. An isolated nucleic acid molecule comprising the nucleotide sequence of SEQ ID 
NO: 15, an allelic variant thereof, or a functional fragment thereof which confers TGFp 
regulation. 

66. An isolated nucleic acid molecule comprising a unique fragment of SEQ ID NO: 15. 

15 

67. An expression vector comprising the isolated nucleic acid molecule of claim 65 or 
claim 66. 

68. A method for regulating transcription of a first nucleic acid molecule, comprising 
20 preparing a construct comprising the first nucleic acid molecule operably linked to 

the isolated nucleic acid molecule of claim 65 or claim 66, and 
introducing the construct into an expression system. 

69. The method of claim 68, wherein the expression system is a cell. 

25 

70. The method of claim 69, wherein the cell expresses a TGFp receptor, further 
comprising contacting the cell with TGFp to increase expression of the first nucleic acid 
molecule. 



30 



71. A method for identifying modulators of TGFP-regulated transcriptional activity, 
comprising 

providing an expression system with a reporter construct including SEQ ID NO: 1 5 or a 
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TGFP-regulated fragment thereof operably linked to a nucleic acid encoding a detectable 

expression product, 

contacting the expression system with a candidate modulator compound, 
incubating the expression system under conditions which, in the absence of the candidate 
5 modulator, permit a first amount of expression of the detectable expression product, and 

detecting a test amount of the expression the detectable expression product, wherein 

detection of an increase in the expression of the detectable expression product in the presence of 

the candidate modulator compound indicates that the candidate modulator compound is a 

compound which increases TGFp-regulated transcriptional activity, and wherein detection of a 
1 0 decrease in the expression of the detectable expression product in the presence of the candidate 

modulator compound indicates that the candidate modulator compound is a compound which 

decreases TGFp-regulated transcriptional activity. 

72. The method of claim 71, wherein the expression system is a selelcted from the group 
15 consisting of cells and in vitro cell-free transcription systems. 



73. The method of claim 71, wherein the detectable expression product is a reporter protein. 
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